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MANNOSIDASE STRUCTURES 



A portion of the disclosure of this patent document contains material that is subject to 
copyright protection. The copyright owner has no objection to the facsimile reproduction by 
5 anyone of the patent document or patent disclosure, as it appears in the Patent and Trademark 
Office patent file or records, but otherwise reserves all copyright rights whatsoever. 

FIELD OF THE INVENTION 

10 The present invention relates to crystal structures. In particular, the invention relates to crystals 
comprising a mannosidase II ligand binding domain (LBD), optionally having a ligand which 
is associated therewith. The structures may be used to determine mannosidase homologues 
and information about the secondary and tertiary structures of polypeptides which are as yet 
structurally uncharacterised. The structures may also be used to identify ligands which are 

15 capable of binding the ligand binding domain. Such ligands may be capable of acting as 
modulators of mannosidase II activity. 

BACKGROUND 

20 Mannosidase II enzymes 

There has been widespread interest in mannosidases in recent years, largely due to their role 
in a multitude of biological systems and, as a result, their potential as therapeutic targets. In 
particular, mammalian Golgi a-mannosidase II is involved in glycoprotein biosynthesis 
(especially in the maturation of N-linked oligosaccharides on newly synthesized 
25 glycoproteins) and is currently an important therapeutic target for the development of anti- 
cancer agents (Goss et al (1995) Clin. Cancer Res. 1 :935-944). 

Golgi a-mannosidase II (mannosyl oligosaccharide 1,3-1,6-a-mannosidase II, EC 3.2.1.114; 
also referred to herein as "GMII") belongs to the glycosyl hydrolase family 38 (Henrissat, 
30 1991; Coutinho and Henrissat, 1999) and is central to the Golgi processing pathway, as it 
specifically trims two mannose residues from the branched GlcNAcMan 5 GlcNAc 2 mannose 
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intermediate (Figure 8A) to form the core GlcNAcMan 3 GlcNAc 2 glycosyl structure, an 
essential precursor for the further addition of W-acetyl-glucosamine units. GMII is a Type II 
transmembrane protein, approximately 125 kD in size, composed of a short N-terminal 
cytoplasmic tail, a single-span transmembrane domain and a large lumenal C-terminal 

5 catalytic portion (Moremen and Touster, 1985, 1986). The enzyme is highly specific for the 
presence of the single GlcNAc attached in a ccl,2 linkage to the Man al,3-Man arm of the 
GlcNAcMan 5 GlcNAc 2 -Asn-X substrate (Harpaz and Schachter, 1980). It removes the di- 
mannose branch (M6, M7; Figure 8A) by hydrolysis of both glycosidic bonds with net 
retention of sugar anomeric configuration, resulting in the final tri-mannose 

10 GlcNAcMan 3 GlcNAc 2 core. There is little or no experimental evidence to date addressing 
whether the two bonds are cleaved in separate binding sites or sequentially in the same 
binding site, nor whether or not the singly-hydrolyzed product is released from the enzyme 
between the two cleavage events. 

15 Mammalian lysosomal-mannosidase has significant sequence similarity to the GM II enzyme 
and is responsible for glycoprotein degradation (Moremen et al (1994) Glycobiology 4 113- 
125; Liao et al (1996) J. Biol. Chem. 271:28348-28358). In particular, lysosomal a- 
mannosidase II is involved in the catabolism of N-linked glycoproteins through the sequential 
degradation of high mannose, hybrid and complex oligosaccharides. 

20 

Mutations in the gene encoding mannosidase II cause a-mannosidosis, an autosomal recessive 
lysosomal storage disease (Ockermann (1967) Lancet 2:239-241). 

A number of mannosidase II genes have been characterised from different sources, including 
25 the Drosophila gene (Foster et al (1995) Gene 154: 183-186; Rabouille et al (1999) J. Cell Sci. 
112:3319-3330), rat gene (Spiro et al (1997) J. Biol. Chem. 272:29356-29363) and human, 
mouse, bovine and feline genes (Beccari et al (1999) Bioscience reports 19:158-162). These 
mannosidases have been categorized as class II mannosidases, based on sequence alignment, 
and belong to family 38 in Henrissat's glycosidase classification (Moremen et al (1994) as 
30 above, Henrissat and Bairoch (1996) Biochem J. 316:695-696). 
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To date there have been significant problems with high level expression of these enzymes, 
which has impeded structural and mechanistic studies. Indeed, problems with expression 
have meant that a-mannosidase from Jack Bean (Canavalia ensiformis) has been used as a 
5 model enzyme for structural and functional characterisation (Howard et al (1998) J. Biol. 
Chem. 273:2067-2072; Kimura et al (1999) Eur. J. Biochem. 164:168-175). In view of the 
potential therapeutic application of mannosidase inhibitors, there is a need for direct structural 
characterisation of these enzymes. 

10 Swainsonine 

Swainsonine (SW) is an indolizidine alkaloid found in Australian Swainsona canescens 
(Colegate etal., Aust J Chem 32:2257-2264, 1979), North American plants of the genera 
Astragalus and (Molyneux R J and James L F., Science 215:190-191, 1981), and also the 
fungus Rhizoctonia leguminicola (Schneider et al., Tetrahedron 39;29-31, 1983). 

15 

Swainsonine is a potent and specific inhibitor of the lysosomal and golgi forms of alpha- 
mannosidase (Cenci di Bello et al., Biochem. J. 215, 693 (1983); Tulsiani et al., J. Biol. 
Chem. 257, 7936 (1982)). It has potential therapeutic value as an antimetastatic (Humpheries 
et al., Cancer Res. 48, 1410 (1988)), and tumor-proliferative (Dennis, Cancer Res. 46, 5131 
20 (1986)), or immunoregulatory agent (Kino et al., J. Antibiot. 38, 936 (1985)). Swainsonine 
has also been shown to have positive effects on cellular immunity in mice (reviewed in 
Humphries M. J. and Olden K., Pharmacol Ther. 44:85-105, 1989, and Olden et al., 
Pharmacol Ther 50:285-290, 1991)). 

25 Structural information about the interaction between swainsonine and mannosidase II 
enzymes would provide a basis for rational modification of swainsonine derivatives with 
altered activities. It would also provide a framework on which new ligands could be designed 
which mimic some of the swainsonine:mannosidase atomic interactions. 
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SUMMARY OF THE INVENTION 



The present invention is based on the finding that, after extensive modifications to the 
protocol, it is possible to express mannosidase II in appreciable quantities. The present 

5 invention is also based on the finding that it is possible to crystallize the protein mannosidase 
II, both alone and in combination with a selection of different ligands. More particularly, it 
has been possible to identify the specific sites of mannosidase II which are associated with 
binding to swainsonine and the mannose-like compound deoxymannojirimycin (DMNJ). The 
structure was also shown to exhibit a previously unobserved folding pattern enabling the 

10 design of novel GMII-specific inhibitors. 

Binding domains are of significant utility in drug discovery. The association of natural ligands 
and substrates with the binding domains of mannosidases is the basis of many biological 
mechanisms. In addition, many drugs (e.g. swainsonine) exert their effects through 

15 association with the binding domains of mannosidases. The associations may occur with all or 
any parts of a binding domain. An understanding of these associations will lead to the design 
and optimization of drugs having more favorable associations with their target enzyme and 
thus provide improved biological effects. Therefore, information about the shape and structure 
of mannosidases and their ligand-binding domains is invaluable in designing potential 

20 modulators of mannosidases for use in treating diseases and conditions associated with or 
modulated by the mannosidases. 

Thus, according to a first aspect of the invention, there is provided a crystal comprising a 
mannosidase II ligand-binding domain. In a preferred embodiment the crystal is a crystal of a 
25 mannosidase II enzyme. The structure of a crystal of mannosidase II has been solved and is 
set forth in Table 1, Table 2, or Table 8. 

The crystal may comprise a complex between a mannosidase II ligand-binding domain and at 
least one ligand, for example an inhibitor of mannosidase II. In a particularly preferred 
30 embodiment that crystal comprises a complex between mannosidase II and swainsonine. The 
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structure of a crystal of a complex between mannosidase II and swainsonine has been solved, 
and is set forth in Table 2 or Table 8. 

In a second aspect, the present invention provides a crystal comprising swainsonine or a 
5 derivative thereof. In a preferred embodiment, the crystal comprises a complex between 
swainsonine (or a derivative thereof) and a mannosidase II ligand-binding domain. The 
structure of a crystal of a complex between mannosidase II and swainsonine has been solved, 
and is set forth in Table 2, or Table 8. 

10 According to a third aspect of the invention, there is provided a model of at least part of a 
mannosidase II, made using a crystal according to the first aspect of the invention. In a 
preferred embodiment, the model comprises the mannosidase II ligand-binding domain. 
There is also provided a model of swainsonine or a derivative thereof made using a crystal 
according to the second aspect of the invention. 

15 

The crystal of the first and second aspect of the invention and a model of the third aspect of 
the invention may be provided in the form of a computer readable medium. 

The crystals and models of earlier aspects of the invention may provide information about the 
20 atomic contacts involved in the interaction between the enzyme and a known ligand, which 
can be used to screen for unknown ligands. According to a fourth aspect of the invention, 
there is provided a method of screening for a ligand capable of binding a mannosidase II 
ligand binding domain, comprising the use of a crystal according to the first or second aspects 
of the invention or a model according to the third aspect of the invention. For example, the 
25 method may comprise the step of contacting the ligand binding domain with a test compound, 
and determining if said test compound binds to said ligand binding domain. 

In a fifth aspect, the present invention provides a ligand identified by a screening method of 
the fourth aspect of the invention. Preferably the ligand is a modulator that is capable of 
30 modulating the activity of a mannosidase II enzyme. 
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A crystal and/or model of the invention may be used to design, evaluate, and identity 
modulators of a mannosidase II or homoiogues thereof other than ligands that associate with a 
mannosidase II. The modulators may be based on the shape and structure of a mannosidase II, 
or a ligand binding domain or atomic interaction, or atomic contacts thereof. Therefore 
modulators may be derived from ligand binding domains or analogues or parts thereof. 

Modulators (e.g. ligands) which are capable of modulating the activity of mannosidase II 
enzymes have considerable therapeutic and prophylactic potential. In a sixth aspect, the 
present invention provides the use of a modulator of the invention in the manufacture of a 
medicament to treat and/or prevent a disease in a mammalian patient. There is also provided 
a pharmaceutical composition comprising a modulator and a method of treating and/or 
preventing a disease comprising the step of administering such a modulator or pharmaceutical 
composition to a mammalian patient. 

A potential modulator of a mannosidase II identified by a method of the present invention 
may be confirmed as a modulator by synthesizing the compound, and testing its effect on the 
enzymatic activity of mannosidase II in an assay. Such assays are known in the art. 

Therefore, the methods of the invention for identifying ligands or modulators may comprise 
one or more of the following additional steps: 

(a) testing whether the modulator or ligand is a modulator of the activity of a mannosidase 
II, preferably testing the activity of the modulator or ligand in cellular assays and 
animal model assays; 

(b) modifying the modulator or ligand; 

(c) optionally rerunning steps (a) or (b); and 

(d) preparing a pharmaceutical composition comprising the modulator or ligand. 

Steps (a), (b) (c) and (d) may be carried out in any order, at different points in time, and they 
need not be sequential. 
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The crystal structures and models described above also provide information about the 
secondary and tertiary structure of mannosidase II enzymes. This can be used to gleen 
structural information about other, previously uncharacterised polypeptides. According to a 
seventh aspect of the invention there is provided a method of determining the secondary 
and/or tertiary structures of polypeptides with unknown (or only partially known) structure 
comprising the step of using such a crystal or model. The polypeptide under investigation is 
preferably structurally or functionally related to the mannosidase II enzyme. For example, the 
polypeptide may show a degree of homology over some or all parts of the primary amino acid 
sequence. Alternatively, the polypeptide may perform an analogous function or be suspected 
to show a similar catalytic mechanism to the mannosidase II enzyme. 

Aspects of the invention are presented in the accompanying claims and in the following 
description, drawings, and Tables. 

DESCRIPTION OF THE FIGURES AND TABLES 

The present invention will now be described only by way of example and with reference to 
the accompanying figures and tables, wherein: 

Figure 1 shows the active site of mannosidase II. 

Figure 2 shows the secondary structure of Drosophila Golgi a-mannosidase II. Helices are in 
blue and |3 sheets are in red. 

Figure 3 shows the Drosophila golgi a-mannosidase II molecule with the colours representing 
where it is identical to human GMII. The red and blue represent deletions or insertions with 
respect to the human sequence. The green is a disulphide bond. 
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Figure 4 shows the whole Drosophila golgi oc-mannosidase II molecule in sticks with residues 
that are identical in the lysosomal manll as coloured balls (red or blue depending whether 
they are in the N-terminal or C-terminal part of the molecule). 

Figure 5 shows the active site of a Drospholiga mannosidase. 

Figure 6 shows the DNA sequence of an expressed Drosophila mannosidase. 

Figure 7 shows an alignment of expressed secreted Drosophila mannosidase with human 
mannosidase. 

Figure 8 shows A). Schematic representation of the high mannose GlcNAcMan 5 GlcNAc 2 
substrate of dGMII. B) Ribbon representation of the dGMII structure, top-view, C) side-view. 
The loop formed by residues 527-540 is shown in yellow. All molecular images were 
prepared using MOLSCRIPT (Kraulis, 1991) and rendered using Raster3D (Merritt and 
Bacon, 1997) 

Figure 9 shows a molecular surface representation of the convex face (A) and the planar face 
(B) of the dGMII molecule. Molecular surface images are colored for electrostatic potential 
(red for negative, blue for positive). C) Molecular surface representation of the planar face of 
dGMII, colored for homology with the sequence of human Golgi a-mannosidase II (dark- 
green for identical, light-green for homologous, yellow for similar, and white for different 
residues). Alignment of human and Drosophila Golgi a-mannosidase II sequences (SwissProt 
accession numbers Q 16706 and Q24451, respectively) was performed using the GAP 
program of the Wisconsin package (Version 10, Genetics Computer Group) using the default 
parameters without any manual intervention. The scores were used to colour the molecular 
surface. All molecular surface images were produced using GRASP (Nicholls et al., 1991). 



Figure 10 shows stereo views of the active site of dGMII with bound Tris (A), DMNJ (B), and 
swainsonine (C) molecules. The active site zinc ion is shown in turquoise, the bound inhibitor 
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molecules are rendered in gold and water molecules are represented as transparent red 
spheres. Hydrogen bonds are shown as blue dashed lines. 

Figure 11 shows A) Molecular surface representation of dGMII showing the position of the 
active site bound Tris molecule and the 2-methyl-2,4-pentanediol (MPD) binding site. B) 
Molecular surface representation of dGMII with the GlcNAcMan 5 GlcNAc 2 substrate modeled 
into the binding pocket. The substrate molecule is positioned into the binding pocket with 
otl,6-linked mannose M6 (shown in green) docked into the active site and (31,2-GlcNAc 
residue G3 (shown in black) placed in the MPD binding site. Individual mannose residues of 
the substrate are colored according to the coloring scheme used in Figure 8A. C) 
Representation of the sequential trimming of the al,6 (M6) and al,3-linked (M7) mannose 
residues. Figure 11A was produced using LIGPLOT (Wallace et al., 1995). All molecular 
surface images were produced using GRASP (Nicholls et al., 1991). 

Table 1 shows the structural coordinates of a Drosophila Golgi a-mannosidase II. 

Table 2 shows the structural coordinates of a Drosophila Golgi a-mannosidase II with 
swainsonine. 

Table 3 shows the ligand binding domain (active site) of a mannosidase II. 

Table 4 shows the intermolecular contacts of a Drosophila Golgi a-mannosidase II 
swainsonine complex. 

Table 5 shows crystallographic refinement statistics for the native Drosophila Golgi 
mannosidase II. 

Table 6 shows crystallographic refinement statistics for Drosophila Golgi mannosidase II 
associated with swainsonine. 
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Table 7 shows a list of Mannosidase II enzymes. 

Table 8 shows the structural coordinates of a Drosophila Golgi cc-mannosidase II with 
swainsonine, a zinc ion, Tris molecule and an N-glycan. 

Table 9 shows data collection statistics for MAD (Se-Met) of dGMII and native dGMII. 

Table 10 shows refinement statistics of dGMII, dGMII-swainsonine complex, and dGMII- 
DMNJ complex. 

In Tables 1, 2, and 8 from the left, the second column identifies the atom number; the third 
identifies the atom type; the fourth identifies the amino acid type; the sixth identifies the residue 
number; the seventh identifies the x coordinates; the eighth identifies the y coordinates; the 
ninth identifies the z coordinates; the tenth identifies the occupancy; and the eleventh identifies 
the temperature factor. 

DETAILED DESCRIPTION OF THE INVENTION 

Unless otherwise indicated, all terms used herein have the same meaning as they would to one 
skilled in the art of the present invention. Practitioners are particularly directed to Current 
Protocols in Molecular Biology (Ansubel) for definitions and terms of the art. Abbreviations 
for amino acid residues are the standard 3-letter and/or 1 -letter codes used in the art to refer to 
one of the 20 common L-amino acids. 

In a first aspect, the present invention relates to a crystal comprising a mannosidase II ligand 
binding domain. 

Crystal 

As used herein, the term "crystal" means a structure (such as a three dimensional (3D) solid 
aggregate) in which the plane faces intersect at definite angles and in which there is a regular 
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structure (such as internal structure) of the constituent chemical species. Thus, the term 
"crystal" can include any one of: a solid physical crystal form such as an experimentally 
prepared crystal, a crystal structure derivable from the crystal (including secondary and/or 
tertiary and/or quaternary structural elements), a 2D and/or 3D model based on the crystal 
structure, a representation thereof such as a schematic representation thereof or a diagrammatic 
representation thereof, or a data set thereof for a computer. 

In one aspect, the crystal is usable in X-ray crystallography techniques. Here, the crystals used 
can withstand exposure to X-ray beams used to produce a diffraction pattern data necessary to 
solve the X-ray crystallographic structure. A crystalline form of a mannosidase, may be 
characterized as being capable of diffracting x-rays in a pattern defined by one of the crystal 
forms depicted in Blundel et al 1976, Protein Crystallography, Academic Press. 

A crystal of the invention includes a mannosidase II or part thereof (e.g. ligand binding 
domain) in association with one or more moieties, including heavy-metal atoms i.e. a 
derivative crystal, a metal cofactor, or one or more ligands or substrates i.e. a co-crystal. 

The term "associate", "association" or "associating" refers to a condition of proximity 
between a moiety (i.e. chemical entity or compound or portions or fragments thereof), and a 
mannosidase II, or parts or fragments thereof (e.g. binding sites or domains). The association 
may be non-covalent i.e. where the juxtaposition is energetically favoured by for example, 
hydrogen-bonding, van der Waals, or electrostatic or hydrophobic interactions, or it may be 
covalent. 

The term "heavy-metal atoms" refers to an atom that can be used to solve an x-ray 
crystallography phase problem, including but not limited to a transition element, a lanthanide 
metal, or an actinide metal. Lanthanide metals include elements with atomic numbers between 
57 and 71, inclusive. Actinide metals include elements with atomic numbers between 89 and 
103, inclusive. 
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Multiwavelength anomalous diffraction (MAD) phasing may be used to solve protein 
structures using selenomethionyl (SeMet) proteins. Therefore, a complex of the invention may 
comprise a crystalline mannosidase II or part thereof (e.g. ligand binding domain) with 
selenium associated with the methionine residues of the protein. 

5 

In an embodiment of the invention, a ligand binding domain is in association with a metal 
cofactor in the crystal. A "metal cofactor" refers to a metal required for mannosidase activity 
and/or stability. For example, the metal cofactor may be zinc, and other similar atoms or 
metals. In a preferred embodiment a LBD is in association with Zn . 

10 

A ligand binding domain in a complex with a cofactor preferably comprises one or more of 
the residues involved in coordination of a Zn 2+ ion, namely: aspartate residues 92 and 204, 
and histidines 90 and 471. 

15 The crystal may comprise a complex between a ligand-binding domain and one or more 
ligands. In other words the ligand binding domain may be associated with one or more 
ligands in the crystal. The ligand may be any compound which is capable of interacting 
stably and specifically with the ligand binding domain. The ligand may, for example, be an 
inhibitor of mannosidase II, including but not limited to swainsonine and the mannose-like 

20 compound deoxymannojirimycin (DMNJ). 

In a preferred embodiment the ligand associated with said mannosidase II ligand binding 
domain is swainsonine, or an analogue or derivative thereof. Swainsonine is an indolizidine 
alkaloid found in a variety of sources (Colegate et al, (1979); Molyneux and James (1981); 
25 and Schneider et al (1983) all as above) which has been known to be an inhibitor of 
mannosidase II enzymes for some time. Derivatives of swainsonine are also known in the art, 
for example US 5962467, US 5,650,413, and U.S. 6,048,870, describe various derivatives of 
swainsonine, processes for their preparation and their use as therapeutic agents. 
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In an embodiment a crystal of the invention comprises a ligand binding domain of a 
mannosidase II in association with swainsonine. These complexes may have the structural 
coordinates shown in Table 2, or Table 8. 

In a second aspect, the present invention also provides a crystal comprising swainsonine or a 
derivative thereof. Preferably the swainsonine molecule has the three dimensional structure 
defined by the relevant structural coordinates shown in Table 2, or Table 8. 

The crystal may also comprise a complex between mannosidase II (or part thereof) and a 
substrate, or analogue thereof. The term "substrate" refers to molecules that associate with a 
mannosidase II as it hydrolyzes linkages between mannose residues. Mannosidases II enzymes 
release a-D-mannose as a first formed product and they follow a double-displacement 
mechanism in which a glycosyl-enzyme intermediate is formed and hydrolyzed via 
oxocarbenium ion-like transition states. The formation of the intermediate is assisted by general 
acid catalysis from a carboxylic acid located in the active site. The residue also serves as the 
general base catalyst for the second deglycosylation step. A second carboxylic acid serves as the 
nucleophile that forms the covalent intermediate. Thus, the substrate molecule may comprise 
molecules such as the glycosyl moiety that forms an intermediate with the enzyme. (See 
Howard, S. et al, J. Biol. Chem. (1998) 273. 2067-2072 and references 11, 12, 14, 15, and 16 
therein). An analogue of a substrate is one which mimics the substrate binding in the LBD, but 
which is incapable (or has a significantly reduced capacity) to take part in the catalytic reaction. 

A number of substrates for Golgi a-mannosidase II are known including the artificial 
substrate PNP-mannose (Rabouille et al (1999) as above). Lysosomal mannosidase II is 
involved in glycoprotein degradation. In particular lysosomal mannosidase II hydrolyses 
<x(l,2) a(l,3) and a(l,6) linkages betwwen mannose residues. Substrates for this enzyme are 
thought to include high mannose, hybrid and complex oligosaccharides. 

In an embodiment, the substrate comprises GlcNAcMan 5 GlcNAc 2 -Asn-. 
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A complex may comprise one or more of the intermolecular interactions identified in Table 4. 
A structure of a complex of the invention may be defined by selected intermolecular contacts, 
preferably the intermolecular contacts as defined in Table 4. 

5 A crystal of the invention may be characterized by an N-terminal a/p domain, a C-terminal 
portion comprising a three-helical bundle, and an all-P C-terminal domain, connected by 5 
internal disulfide bonds and stabilized by a zinc binding site (Figure 8B). 

The N-terminal a/p domain is characterized as follows: 
10 (a) comprising an inner core of three p-sheets (A, B and C, Figure 8B) consisting of 1 1, 
mostly parallel p-strands, surrounded by 16 a-helices; 

(b) comprising a GlcNAc residue at a consensus N-glycosylation site (Asn-194), located 
at the N-terminus of helix 7. 

(c) stabilized by three disulfide bonds: between Cys-31 and Cys-1032 connecting the N 
15 and C-terminal extremes of dGMII; Cys-275 and Cys-282 linking helices 10 and 11; 

Cys-283 and Cys-297 linking helix 11 with a loop between helix 13 and the core of 
parallel P-sheets. 

The C-terminal portion is characterized as follows: 
20 (a) a three-helix bundle comprises helices 1 8, 20 and 21 connected to the N-terminal a/p- 
domain via a zinc binding site. 

(b) a zinc ion coordinated in a T 5 -square-based pyramidal geometry involving residues: 
Asp-90, His-92, Asp-204 and His-471. 

(c) two immunoglobulin-like domains: a small p-sandwich consisting of 12 anti-parallel 
25 strands from p-sheets D and E, and a large 21 -strand structure involving p-sheets F 

andG. 

(d) a barrel formed by the three-helix bundle, helix-23, and the two p-sandwich structures 
providing a narrow pore in the center of the C-terminal domain. 
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The barrel in the C-terminal portion is lined by six arginine residues: Arg-540, 565, 617, 770, 
777 and 893, contributing to the overall positive charge of the pore (Figure 9A). A hairpin 
loop, connecting two strands of f5-sheet D (Figure 8B and C, residues 527-540, shown in 
yellow) protrudes into the center of the barrel on the planar side of the molecule. Arginine 
residue 530, located at the tip of the type-I p-turn in this loop, plugs the pore preventing an 
open channel through the protein. The resulting crater-like cavity on the convex side of the 
molecule is 20A deep, with a diameter of 20A funneling to 8A at the bottom of the cavity. 
The loop has a higher degree of flexibility compared to the rest of the structure (average B- 
factor values: -33 A 2 and ~15A 2 , respectively). 

A crystal of the invention may enable the determination of structural data for a ligand or 
substrate. In order to be able to derive structural data for the ligand or substrate, it is necessary 
for the molecule to have sufficiently strong electron density to enable a model of the molecule 
to be built using standard techniques. For example, there should be sufficient electron density 
to allow a model to be built using XTALVIEW (McRee 1992 J. Mol. Graphics. 10 44-46). 

Preferably, the crystal of the invention belongs to space group Y2 X 2{2\. 

The term "space group" refers to the lattice and symmetry of the crystal. In a space group 
designation the capital letter indicates the lattice type and the other symbols represent 
symmetry operations that can be carried out on the contents of the asymmetric unit without 
changing its appearance. 

Preferably, a crystal of said complex comprises a unit cell having the following unit 
dimensions: a=69 (±5) A, b=l 10 (±5) A, c=139 (±5) A. 

The term "unit cell" refers to the smallest and simplest volume element (i.e. parallelpiped- 
shaped block) of a crystal that is completely representative of the unit of pattern of the crystal. 
The unit cell axial lengths are represented by a, b, and c. Those of skill in the art understand 
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that a set of atomic coordinates determined by X-ray crystallography is not without standard 
error. 

In a highly preferred embodiment, the crystal comprises the structural coordinates as shown in 
Table 1, Table 2, or Table 8. 

As used herein, the term "structural coordinates" refer to a set of values that define the 
position of one or more amino acid residues with reference to a system of axes. The term 
refers to a data set that defines the three dimensional structure of a molecule or molecules 
(e.g. Cartesian coordinates, temperature factors, and occupancies). Structural coordinates can 
be slightly modified and still render nearly identical three dimensional structures. A measure 
of a unique set of structural coordinates is the root-mean-square deviation of the resulting 
structure. Structural coordinates that render three dimensional structures (in particular a three 
dimensional structure of an SGC domain) that deviate from one another by a root-mean- 
square deviation of less than 5 A, 4 A, 3 A, 2 A, or 1.5 A may be viewed by a person of 
ordinary skill in the art as very similar. 

Variations in structural coordinates may be generated because of mathematical manipulations 
of the structural coordinates of a mannosidase described herein. For example, the structural 
coordinates of Table 1, 2, or 8 may be manipulated by crystallographic permutations of the 
structural coordinates, fractionalization of the structural coordinates, integer additions or 
substractions to sets of the structural coordinates, inversion of the structural coordinates or 
any combination of the above. 

Variations in the crystal structure due to mutations, additions, substitutions, and/or deletions 
of the amino acids, or other changes in any of the components that make up the crystal may 
also account for modifications in structural coordinates. If such modifications are within an 
acceptable standard error as compared to the original structural coordinates, the resulting 
structure may be the same. Therefore, a ligand that bound to a ligand binding domain of a 
mannosidase would also be expected to bind to another ligand binding domain whose 
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structural coordinates defined a shape that fell within the acceptable error. Such modified 
structures of a ligand binding domain thereof are also within the scope of the invention. 

Various computational analyses may be used to determine whether a molecule or the ligand 
binding domain thereof is sufficiently similar to all or parts of a ligand binding domain 
thereof. Such analyses may be carried out using conventional software applications and 
methods as described herein. 

The crystal may also be specifically characterised by the refinement statistics set out in Tables 
5, 6, or 10. 

MANNOSIDASE II 

The term "mannosidase II" refers to eukaryotic mannosidases involved in the biosynthesis of 
glycoproteins, glycolipids, glycosylphosphatidylinositols and other complex glycoconjugates, 
and prokaryotic mannosidases involved in the synthesis of carbohydrate structures of bacteria 
and viruses. In particular, the term refers to the class of mannosidases categorized as class II 
mannosidases, based on sequence alignment, belonging to family 38 in Henrissat's 
glycosidase classification (Moremen, K.W. et al (1994) GlycoBiology 4, 113-125; Henrissat, 
B. and Bairoch A. (1996) Biochem J. 316, 695-696; Henrissat, B. and Bairoch A. (1993) 
Biochem J. 293, 781-788; Henrissat, B. and Bairoch A. (1991) Biochem J. 280, 309-316). 
Examples of mannosidase II enzymes include those listed in Table 7 (from http://afmb.cnrs- 
mrs.fr/~pedro/CAZY/ghf_38.html). 

The invention generally relates to mannosidase II enzymes and parts thereof. Mannosidase II 
enzymes catalyze the first committed step in the biosynthesis of complex N-glycans and they 
control conversion of high mannose to complex N-glycans. 

Mannosidases are derivable from a variety of sources, including viruses, bacteria, fungi, 
plants, and animals. In a preferred embodiment the glycosyltransferase is derivable from an 
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animal, preferably a mammal including but not limited to bovine, ovine, porcine, murine 
equine, most preferably a human. The enzyme may be from any source, whether natural, 
synthetic, semi-synthetic, or recombinant. 

A mannosidase or part thereof in the present invention may be a wild type enzyme, or part 
thereof, or a mutant, variant or homologue of such an enzyme. 

The term "wild type" refers to a polypeptide having a primary amino acid sequence which is 
identical with the native enzyme (for example, the mammalian enzyme). 

The term "mutant" refers to a polypeptide having a primary amino acid sequence which 
differs from the wild type sequence by one or more amino acid additions, substitutions or 
deletions. Preferably, the mutant has at least 90% sequence identity with the wild type 
sequence. Preferably, the mutant has 20 mutations or less over the whole wild-type sequence. 
More preferably the mutant has 10 mutations or less, most preferably 5 mutations or less over 
the whole wild-type sequence. 

The term "variant" refers to a naturally occurring polypeptide which differs from a wild-type 
sequence. A variant may be found within the same species (i.e. if there is more than one 
isoform of the enzyme) or may be found within a different species. Preferably the variant has 
at least 90% sequence identity with the wild type sequence. Preferably, the variant has 20 
mutations or less over the whole wild-type sequence. More preferably, the variant has 10 
mutations or less, most preferably 5 mutations or less over the whole wild-type sequence. 

The term "part" indicates that the polypeptide comprises a fraction of the wild-type amino 
acid sequence. It may comprise one or more large contiguous sections of sequence or a 
plurality of small sections. In an embodiment, the "part" comprises a wild type mannosidase 
enzyme with the cytosolic and transmembrane domains and most of the stalk region 
eliminated, preferably the "part" comprises amino acid residues 31-1044 of Golgi a- 
mannosidase. The "part" may comprise a ligand binding domain as described herein. The 
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polypeptide may also comprise other elements of sequence, for example, it may be a fusion 
protein with another protein (such as one which aids isolation or crystallisation of the 
polypeptide). Preferably the polypeptide comprises at least 50%, more preferably at least 
65%, most preferably at least 80% of the wild-type sequence. 

The term "homologue" means a polypeptide having a degree of homology with the wild-type 
amino acid sequence. The term "homology" can be equated with "identity". 

In the present context, an homologous sequence is taken to include an amino acid sequence 
which may be at least 75, 85 or 90% identical, preferably at least 95 or 98% identical to the 
wild-type sequence. Typically, the homologues will comprise the same sites (for example 
ligand binding domain) as the subject amino acid sequence. Although homology can also be 
considered in terms of similarity (i.e. amino acid residues having similar chemical 
properties/functions), in the context of the present invention it is preferred to express 
homology in terms of sequence identity. 

Homology comparisons can be conducted by eye, or more usually, with the aid of readily 
available sequence comparison programs. These commercially available computer programs 
can calculate % homology between two or more sequences. 

Percentage homology may be calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each amino acid in one sequence is directly compared 
with the corresponding amino acid in the other sequence, one residue at a time. This is called 
an "ungapped" alignment. Typically, such ungapped alignments are performed only over a 
relatively short number of residues. 

Although this is a very simple and consistent method, it fails to take into consideration that, 
for example, in an otherwise identical pair of sequences, one insertion or deletion will cause 
the following amino acid residues to be put out of alignment, thus potentially resulting in a 
large reduction in % homology when a global alignment is performed. Consequently, most 



20 

sequence comparison methods are designed to produce optimal alignments that take into 
consideration possible insertions and deletions without penalising unduly the overall 
homology score. This is achieved by inserting "gaps" in the sequence alignment to try to 
maximise local homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs in the 
alignment so that, for the same number of identical amino acids, a sequence alignment with as 
few gaps as possible - reflecting higher relatedness between the two compared sequences - 
will achieve a higher score than one with many gaps. "Affine gap costs" are typically used 
that charge a relatively high cost for the existence of a gap and a smaller penalty for each 
subsequent residue in the gap. This is the most commonly used gap scoring system. High 
gap penalties will of course produce optimised alignments with fewer gaps. Most alignment 
programs allow the gap penalties to be modified. However, it is preferred to use the default 
values when using such software for sequence comparisons. For example when using the 
GCG Wisconsin Bestfit package the default gap penalty for amino acid sequences is -12 for a 
gap and -4 for each extension. 

Calculation of maximum % homology therefore firstly requires the production of an optimal 
alignment, taking into consideration gap penalties. A suitable computer program for carrying 
out such an alignment is the GCG Wisconsin Bestfit package (University of Wisconsin, 
U.S.A.; Devereux et al, 1984, Nucleic Acids Research 12:387). Examples of other software 
than can perform sequence comparisons include, but are not limited to, the BLAST package 
(see Ausubel et al, 1999 ibid - Chapter 18), FASTA (Atschul et al, 1990, J. Mol. Biol., 403- 
410) and the GENEWORKS suite of comparison tools. Both BLAST and FASTA are 
available for offline and online searching (see Ausubel et al, 1999 ibid, pages 7-58 to 7-60). 
However, for some applications, it is preferred to use the GCG Bestfit program. A new tool, 
called BLAST 2 Sequences is also available for comparing protein and nucleotide sequence 
(see FEMS Microbiol Lett 1999 174(2): 247-50; FEMS Microbiol Lett 1999 177(1): 187-8 
and tatiana@ncbi.nlm.nih.gov). 
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Although the final % homology can be measured in terms of identity, the alignment process 
itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity 
score matrix is generally used that assigns scores to each pairwise comparison based on 
chemical similarity or evolutionary distance. An example of such a matrix commonly used is 
5 the BLOSUM62 matrix - the default matrix for the BLAST suite of programs. GCG 
Wisconsin programs generally use either the public default values or a custom symbol 
comparison table if supplied (see user manual for further details). For some applications, it is 
preferred to use the public default values for the GCG package, or in the case of other 
software, the default matrix, such as BLOSUM62. 

10 

Once the software has produced an optimal alignment, it is possible to calculate % homology, 
preferably % sequence identity. The software typically does this as part of the sequence 
comparison and generates a numerical result. 

15 The sequences may have deletions, insertions or substitutions of amino acid residues which 
produce a silent change and result in a functionally equivalent enzyme. Deliberate amino acid 
substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues as long as the 
secondary binding activity of the substance is retained. For example, negatively charged 

20 amino acids include aspartic acid and glutamic acid; positively charged amino acids include 
lysine and arginine; and amino acids with uncharged polar head groups having similar 
hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, 
glutamine, serine, threonine, phenylalanine, and tyrosine. 

25 Conservative substitutions may be made, for example according to the Table below. Amino 
acids in the same block in the second column and preferably in the same line in the third 
column may be substituted for each other: 
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The polypeptide may also have a homologous substitution (substitution and replacement are 
both used herein to mean the interchange of an existing amino acid residue, with an 
alternative residue) i.e. like-for-like substitution such as basic for basic, acidic for acidic, 
polar for polar etc. Non-homologous substitution may also occur i.e. from one class of 
residue to another or alternatively involving the inclusion of unnatural amino acids such as 
ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter referred to 
as B) ; norleucine ornithine (hereinafter referred to as O), pyriylalanine, thienylalanine, 
naphthylalanine and phenylglycine. 

Replacements may also be made by unnatural amino acids include; alpha* and alpha- 
disubstituted* amino acids, N-alkyl amino acids*, lactic acid*, halide derivatives of natural 
amino acids such as trifluorotyrosine*, p-Cl-phenylalanine*, p-Br-phenylalanine*, p-I- 
phenylalanine*, L-allyl-glycine*, fl-alanine*, L-a-amino butyric acid*, L-y-amino butyric 
acid*, L-a-amino isobutyric acid*, L-s-amino caproic acid*, 7-amino heptanoic acid*, L- 
methionine sulfone"*, L-norleucine*, L-norvaline*, p-nitro-L-phenylalanine*, L- 
hydroxyproline*, L-thioproline*, methyl derivatives of phenylalanine (Phe) such as 4-methyl- 
Phe*. pentamethyl-Phe*, L-Phe (4-amino) # , L-Tyr (methyl)*, L-Phe (4-isopropyl)*, L-Tic 
(l,2,3.4-tetrahydroisoquinoline-3-carboxyl acid)*, L-diaminopropionic acid # and L-Phe (4- 
benzyl)*. The notation * has been utilised for the purpose of the discussion above (relating to 
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homologous or non-homologous substitution), to indicate the hydrophobic nature of the 
derivative whereas # has been utilised to indicate the hydrophilic nature of the derivative, #* 
indicates amphipathic characteristics. 

Variant amino acid sequences may include suitable spacer groups that may be inserted 
between any two amino acid residues of the sequence including alkyl groups such as methyl, 
ethyl or propyl groups in addition to amino acid spacers such as glycine or p-alanine residues. 
A further form of variation, involving the presence of one or more amino acid residues in 
peptoid form, will be well understood by those skilled in the art. For the avoidance of doubt, 
"the peptoid form" is used to refer to variant amino acid residues wherein the a-carbon 
substituent group is on the residue's nitrogen atom rather than the a-carbon. Processes for 
preparing peptides in the peptoid form are known in the art, for example Simon RJ et al, 
PNAS (1992) 89(20), 9367-9371 and Horwell DC, Trends Biotechnol. (1995) 13(4), 132-134. 

LIGAND-BINDING DOMAIN 

As used herein, the term "ligand binding domain (LBD)" refers to a region of a molecule or 
molecular complex that as a result of its shape, favourably associates with a ligand or a part 
thereof. For example, it may be a region of a mannosidase that is responsible for binding a 
substrate or modulator (e.g. swainsonine). With reference to the crystal of the present 
invention residues in the LBD may be defined by their spatial proximity to the ligand (for 
example swainsonine or substrate) in the crystal structure. 

"Ligand" refers to a compound or entity that associates with a ligand binding domain, 
including substrates or analogues or parts thereof, or modulators of a mannosidase including 
inhibitors. A ligand may be designed rationally by using a model according to the present 
invention. 

The term "ligand binding domain (LBD)" also includes a homologue of the ligand binding 
domain or a portion thereof. 
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As used herein, the term "homologue" in reference to a iigand binding domain refers to ligand 
binding domain or a portion thereof which may have deletions, insertions or substitutions of 
amino acid residues as long as the binding specificity of the molecule is retained. In this 
5 regard, deliberate amino acid substitutions may be made on the basis of similarity in polarity, 
charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues as long as the binding specificity of the ligand binding domain is retained. 

As used herein, the term "portion thereof 1 means the structural coordinates corresponding to a 
10 sufficient number of amino acid residues of the mannosidase II LBD (or homologues thereof) 
that are capable of interacting with a test compound capable of binding to the LBD. This term 
includes mannosidase II ligand binding domain amino acid residues having an amino acid 
residues from about 4A to about 5A of a bound compound or fragment thereof. Thus, for 
example, the structural coordinates provided in the crystal structure may contain a subset of 
15 the amino acid residues in the LBD which may be useful in the modelling and design of 
compounds that bind to the LBD. 

A ligand binding domain may be defined by its association with a ligand. With reference to a 
crystal of the present invention, residues in the LBD may be defined by their spatial proximity 
20 to a ligand in the crystal structure. For example, such may be defined by their proximity to a 
substrate or modulator (e.g. swainsonine). 

The active site of a mannosidase II crystal of the invention may be characterized as follows: 
(a) a small cavity lined by aromatic residues Trp-95, Phe-206, Tyr-269 and Tyr-727; 
25 (b) a zinc ion binding site within the cavity characterized by a T 5 -square-based pyramidal 

geometry and 'elec-His-Zn motifs'. 

A binding domain for a GMII inhibitor such as swainsonine and DMNJ, comprises one or 
more of Trp-95, Phe-206 and Tyr-727 which form a binding cavity for the inhibitor. The 
30 inhibitor ring structures can be stacked against Trp-95, and stabilized by hydrogen bonds and 
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interactions with the zinc ion. When bound to an inhibitor the zinc ion binding domain of the 
GMII can be transformed into T 6 -octahedral coordination. The binding domain allows for the 
formation of a hydrogen bond between the zinc-coordinating OD1 oxygen of Asp-204 and the 
N4 nitrogen at the fusion of the five and six-membered rings of swainsonine. The zinc 
coordinating oxygen atoms of the inhibitors are involved in hydrogen bond interactions with 
the neighboring metal binding residues of the enzyme. 

The position of the inhibitor molecules is stabilized in the active site by hydrogen bonds 
between carboxylic oxygens OD1 and OD2 of residue Asp-472 and hydroxyl oxygens 03 and 
04 (05 in swainsonine) of the inhibitors. DMNJ is involved in additional hydrogen bonds, via 
water molecules, with the NH 2 nitrogen of Arg-228, the hydroxyl oxygen of Tyr-269, the 
backbone carbonyl oxygen of Arg-876, and the 0D1 oxygen of Asp-204. 

In an embodiment, a ligand binding domain comprises one or more of the following amino 
acid residues: His 471, His 90, and Asp 92, and Asp 204; or a homologue thereof 

In a second embodiment, a ligand binding domain comprises one or more of the following 
amino acid residues: Trp-95, Phe-206, Tyr-269, and Tyr-727. 

In another embodiment, a ligand binding domain comprises one or more of the following 
amino acid residues: Asp-92, Asp-204, His-90, His-471. 

In still another embodiment, a ligand binding domain comprises one or more of the following 
amino acid residues: His 471, Asp 204, Asp 341, His 90, Asp 92, Asp 472, Phe 206, Tyr 727 
and Trp 95; or a homologue thereof 

In yet another embodiment a ligand binding domain comprises one or more of the following 
groups: 



(a) GVWKQG (residues 60-65) 
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(b) 
(c) 
(d) 



(e) 



VFVVPHSHND (residues 83-92) 
WAIDPFGH (residues 201-208) 

HMMPFYSYDIPHTCGPDPK V /,CCQFDFKR (residues 262-289) 
LL'/aPLGDDFR (residues 334-343): 



In an aspect of the invention, a ligand binding domain comprises one or more of the enzyme 
residues shown in Table 3 and/or Table 4. 

A crystal of a binding domain may be defined by selected atomic contacts. 

In an embodiment, the binding site of the mannosidase II inhibitor swainsonine is described in 
Table 3, and details of the atomic interactions of the binding site are set out in Table 4. In the 
swainsonine binding site there are direct hydrogen bonds between the inhibitor and the 
enzyme. Atomic contacts on the enzyme comprise Trp-95, Phe-206, Tyr-727, Asp-472, Asp 
204 (see Table 4, Figures 1 and 5). 

In a particular embodiment of the invention, a secondary or three-dimensional structure of a 
binding domain of a mannosidase II that associates with an inhibitor of a mannosidase II is 
provided comprising at least two or three atomic contacts of the atomic interactions in Table 
4, each atomic interaction defined therein by an atomic contact (more preferably, a specific 
atom where indicated) on the inhibitor, and an atomic contact (more preferably, a specific 
amino acid residue where indicated) on the mannosidase II (i.e. enzyme atomic contact). 
Preferably, the binding domain is defined by the atoms of the enzyme atomic contacts having 
the structural coordinates for the atoms listed in Table 1 , 2, or 8. 

METHOD OF MAKING A CRYSTAL 



The present invention also provides a method of making a crystal according to the invention. 
The crystal may be formed from an aqueous solution comprising a purified polypeptide 
comprising a mannosidase II or part or fragment thereof (e.g. a catalytic portion, ligand 
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binding domain). A method may utilize a purified polypeptide comprising a mannosidase II 
ligand binding domain to form a crystal 

The term "purified" in reference to a polypeptide, does not require absolute purity such as a 
homogenous preparation rather it represents an indication that the polypeptide is relatively 
purer than in the natural environment. Generally, a purified polypeptide is substantially free of 
other proteins, lipids, carbohydrates, or other materials with which it is naturally associated, 
preferably at a functionally significant level for example at least 85% pure, more preferably at 
least 95% pure, most preferably at least 99% pure. A skilled artisan can purify a polypeptide 
comprising a mannosidase II using standard techniques for protein purification. A 
substantially pure polypeptide comprising a mannosidase II will yield a single major band on 
a non-reducing polyacrylamide gel. The purity of the mannosidase II can also be determined 
by amino-terminal amino acid sequence analysis. 

A polypeptide used in the method may be chemically synthesized in whole or in part using 
techniques that are well-known in the art. Alternatively, methods are well known to the 
skilled artisan to construct expression vectors containing the native or mutated mannosidase II 
coding sequence and appropriate transcriptional/translational control signals. These methods 
include in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
recombination/genetic recombination. See for example the techniques described in Sambrook 
et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory 
press (1989)), and other laboratory textbooks. (See also Sarker et al, Glycoconjugate J. 7:380, 
1990; Sarker et al, Proc. Natl. Acad, Sci. USA 88:234-238, 1991, Sarker et al, Glycoconjugate 
J. 11: 204-209, 1994; Hull et al, Biochem Biophys Res Commun 176:608, 1991 and Pownall 
et al, Genomics 12:699-704, 1992). 

Crystals may be grown from an aqueous solution containing the purified mannosidase II 
polypeptide by a variety of conventional processes. These processes include batch, liquid, 
bridge, dialysis, vapor diffusion, and hanging drop methods. (See for example, McPherson, 
1982 John Wiley, New York; McPherson, 1990, Eur. J. Biochem. 189: 1-23; Webber. 1991, 
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Adv. Protein Chem. 41:1-36). Generally, the native crystals of the invention are grown by 
adding precipitants to the concentrated solution of the mannosidase II polypeptide. The 
precipitants are added at a concentration just below that necessary to precipitate the protein. 
Water is removed by controlled evaporation to produce precipitating conditions, which are 
maintained until crystal growth ceases. 

Derivative crystals of the invention can be obtained by soaking native crystals in a solution 
containing salts of heavy metal atoms. A complex of the invention can be obtained by soaking 
a native crystal in a solution containing a compound that binds the polypeptide, or they can be 
obtained by co-crystallizing the polypeptide in the presence of one or more compounds. In 
order to obtain co-crystals with a compound which binds deep within the tertiary structure of 
the polypeptide it is necessary to use the second method. 

Once the crystal is grown it can be placed in a glass capillary tube and mounted onto a 
holding device connected to an X-ray generator and an X-ray detection device. Collection of 
X-ray diffraction patterns are well documented by those skilled in the art (See for example, 
Ducruix and Geige, 1992, IRL Press, Oxford, England). A beam of X-rays enter the crystal 
and diffract from the crystal. An X-ray detection device can be utilized to record the 
diffraction patterns emanating from the crystal. Suitable devices include the Marr 345 
imaging plate detector system with an RU200 rotating anode generator. 

Methods for obtaining the three dimensional structure of the crystalline form of a molecule or 
complex are described herein and known to those skilled in the art (see Ducruix and Geige 
1992, IRL Press, Oxford, England). Generally, the x-ray crystal structure is given by the 
diffraction patterns. Each diffraction pattern reflection is characterized as a vector and the 
data collected at this stage determines the amplitude of each vector. The phases of the vectors 
may be determined by the isomorphous replacement method where heavy atoms soaked into 
the crystal are used as reference points in the X-ray analysis (see for example, Otwinowski, 
1991, Daresbury, United Kingdom, 80-86). The phases of the vectors may also be determined 
by molecular replacement (see for example, Naraza, 1994, Proteins 11:281-296). The 
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amplitudes and phases of vectors from the crystalline form of a mannosidase II determined in 
accordance with these methods can be used to analyze other related crystalline polypeptides. 

The unit cell dimensions and symmetry, and vector amplitude and phase information can be 
used in a Fourier transform function to calculate the electron density in the unit cell i.e. to 
generate an experimental electron density map. This may be accomplished using the PHASES 
package (Furey, 1990). Amino acid sequence structures are fit to the experimental electron 
density map (i.e. model building) using computer programs (e.g. Jones, TA. et al, Acta 
Crystallogr A47, 100-119, 1991). This structure can also be used to calculate a theoretical 
electron density map. The theoretical and experimental electron density maps can be 
compared and the agreement between the maps can be described by a parameter referred to as 
R-factor. A high degree of overlap in the maps is represented by a low value R-factor. The R- 
factor can be minimized by using computer programs that refine the structure to achieve 
agreement between the theoretical and observed electron density map. For example, the 
XPLOR program, developed by Brunger (1992, Nature 355:472-475) can be used for model 
refinement. 

A three dimensional structure of a molecule or complex may be described by atoms that fit 
the theoretical electron density characterized by a minimum R value. Files can be created for 
the structure that defines each atom by coordinates in three dimensions. 

MODEL 

A crystal structure of the present invention may be used to make a model of the mannosidase 
II or a part thereof, (e.g.a ligand-binding domain). A model may, for example, be a structural 
model (or a representation thereof), or a computer model. A model may represent the 
secondary, tertiary and/or quaternary structure of the mannosidase II. The model itself may 
be in two or three dimensions. It is possible for a computer model to be in three dimensions 
despite the constraints imposed by a conventional computer screen, if it is possible to scroll 
along at least a pair of axes, causing "rotation" of the image. 
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Thus, for example, the structural coordinates provided in the crystal structure and/or model 
structure may comprise the amino acid residues of the mannosidase II LBD, or a portion of 
the mannosidase II LBD or a homologue thereof useful in the modelling and design of test 
compounds capable of binding to the mannosidase II LBD. 

As used herein, the term "modelling" includes the quantitative and qualitative analysis of 
molecular structure and/or function based on atomic structural information and interaction 
models. The term "modelling" includes conventional numeric-based molecular dynamic and 
energy minimization models, interactive computer graphic models, modified molecular 
mechanics models, distance geometry and other structure-based constraint models. 

Preferably, modelling is performed using a computer and may be further optimized using 
known methods. This is called modelling optimisation. 

Overlays and super positioning with a three dimensional model of the mannosidase II LBD, 
and/or a portion thereof, can also be used for modelling optimisation. Additionally, alignment 
and/or modelling can be used as a guide for the placement of mutations on the mannosidase II 
LBD surface to characterise the nature of the site in the context of a cell. 

The three dimensional structure of a new crystal may be modelled using molecular 
replacement. The term "molecular replacement" refers to a method that involves generating a 
preliminary model of a molecule or complex whose structural coordinates are unknown, by 
orienting and positioning a molecule whose structural coordinates are known within the unit 
cell of the unknown crystal, so as best to account for the observed diffraction pattern of the 
unknown crystal. Phases can then be calculated from this model and combined with the 
observed amplitudes to give an approximate Fourier synthesis of the structure whose 
coordinates are unknown. This, in turn, can be subject to any of the several forms of 
refinement to provide a final, accurate structure of the unknown crystal. Lattman, E., "Use of 
the Rotation and Translation Functions", in Methods in Enzymology, 115, pp. 55-77 (1985); 
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M. G. Rossmann, ed., 'The Molecular Replacement Method 11 , Int. Sci. Rev. Sen, No. 13, 
Gordon & Breach, New York, (1972). 



Commonly used computer software packages for molecular replacement are X-PLOR 
5 (Brunger 1992, Nature 355: 472-475), AMoRE (Navaza, 1994, Acta Crystallogr. A50:157- 
163), the CCP4 package (Collaborative Computational Project, Number 4, "The CCP4 Suite: 
Programs for Protein Crystallography", Acta Cryst., Vol. D50, pp. 760-763, 1994), the 
MERLOT package (P.M.D. Fitzgerald, J. AppL Cryst, Vol. 21, pp. 273-278, 1988) and 
XTALVIEW (McCree et al (1992) J. Mol. Graphics 10: 44-46, It is preferable that the 
10 resulting structure not exhibit a root-mean-square deviation of more than 3 A. 

The quality of the model may be analysed using a program such as PROCHECK or 3D- 
Profiler [Laskowski et al 1993 J, Appl. Cryst. 26:283-291; Luthy R. et al, Nature 356: 83-85, 
1992; and Bowie, J.U. et al, Science 253: 164-170, 1991]. Once any irregularities have been 
15 resolved, the entire structure may be further refined. 

Other molecular modelling techniques may also be employed in accordance with this 
invention. See, e.g., Cohen, N. C. et al, "Molecular Modelling Software and Methods for 
Medicinal Chemistry 11 , J. Med. Chem,, 33, pp. 883-894 (1990). See also, Navia, M. A. and M. 
20 A. Murcko, "The Use of Structural Information in Drug Design", Current Opinions in 
Structural Biology, 2, pp. 202-210 (1992). 

Using the structural coordinates of the crystal complexes provided by this invention, 
molecular modelling may be used to determine the structure coordinates of a crystalline 
25 mutant or homologue of mannosidase II LBD or of a related protein. By the same token, a 
crystal of the second aspect of the invention can be used to provide a model of swainsonine. 
Modelling techniques can then be used to approximate the three dimensional structure of 
swainsonine derivatives and other compounds which may be able to mimic the atomic 
contacts between swainsonine and the LBD, 
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COMPUTER FORMAT OF CRYSTALS/MODELS 

Information derivable from the crystal of the present invention (for example the structural 
coordinates) and/or a model of the present invention may be provided in a computer-readable 
format. 

Therefore, the invention provides a computer readable medium or a machine readable storage 
medium which comprises the structural coordinates of a mannosidase II including all or any 
parts of the mannosidase II (e.g ligand-binding domain), ligands including portions thereof, or 
substrates including portions thereof Such storage medium or storage medium encoded with 
these data are capable of displaying on a computer screen or similar viewing device, a three- 
dimensional graphical representation of a molecule or molecular complex which comprises 
the enzyme or ligand binding domains or similarly shaped homologous enzymes or ligand 
binding domains. Thus, the invention also provides computerized representations of a crystal 
of the invention, including any electronic, magnetic, or electromagnetic storage forms of the 
data needed to define the structures such that the data will be computer readable for purposes 
of display and/or manipulation. 

In an aspect the invention provides a computer for producing a three-dimensional 
representation of a molecule or molecular complex, wherein said molecule or molecular 
complex comprises a mannosidase II or ligand binding domain thereof defined by structural 
coordinates of mannosidase II amino acids or a ligand binding domain thereof, or comprises 
structural coordinates of atoms of a ligand or substrate, or a three-dimensional representation 
of a homologue of said molecule or molecular complex, wherein said computer comprises: 

(a) a machine-readable data storage medium comprising a data storage material encoded 
with machine readable data wherein said data comprises the structural coordinates of a 
mannosidase II amino acids according to Table 1, 2, or 8 or a ligand binding domain 
thereof, or a ligand (e.g. swainsonine) according to Table 2, or Table 8; 

(b) a working memory for storing instructions for processing said machine-readable data; 
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(c) a central-processing unit coupled to said working memory and to said machine- 
readable data storage medium for processing said machine readable data into said 
three-dimensional representation; and 

(d) a display coupled to said central-processing unit for displaying said three-dimensional 
5 representation. 

A homologue may comprise a mannosidase II or ligand binding domain thereof, or ligand or 
substrate that has a root mean square deviation from the backbone atoms of not more than 1.5 
angstroms. 

10 

The invention also provides a computer for determining at least a portion of the structural 
coordinates corresponding to an X-ray diffraction pattern of a molecule or molecular complex 
wherein said computer comprises: 

(a) a machine-readable data storage medium comprising a data storage material 
15 encoded with machine readable data wherein said data comprises the structural 

coordinates according to Table 1, 2, or 8; 

(b) a machine-readable data storage medium comprising a data storage material 
encoded with machine readable data wherein said data comprises an X-ray 
diffraction pattern of said molecule or molecular complex; 

20 (c) a working memory for storing instructions for processing said machine-readable 

data of (a) and (b); 

(d) a central-processing unit coupled to said working memory and to said machine- 
readable data storage medium of (a) and (b) for performing a Fourier transform of 
the machine readable data of (a) and for processing said machine readable data of 

25 (b) into structural coordinates; and 

(e) a display coupled to said central-processing unit for displaying said structural 
coordinates of said molecule or molecular complex. 
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STRUCTURAL DETERMINATIONS 



The present invention also provides a method for determining the secondary and/or tertiary 
structures of a polypeptide by using a crystal, or a model according to the present invention. 
The polypeptide may be any polypeptide for which the secondary and or tertiary structure is 
uncharacterised or incompletely characterised. In a preferred embodiment the polypeptide 
shares (or is predicted to share) some structural or functional homology to the mannosidase II 
crystal. For example, the polypeptide may show a degree of structural homology over some 
or all parts of the primary amino acid sequence. For example the polypeptide may have one or 
more domains which shows homology with a mannosidase II domain (Kapitonov and Yu 
(1999) Glycobiology 9(10): 961-978). 

The polypeptide may be a mannosidase II with a different specificity for a ligand or substrate. 
The polypeptide may be a mannosidase II which requires a different metal cofactor. 
Alternatively (or in addition) the polypeptide may be a mannosidase II from a different 
species. 

The polypeptide may be a mutant of the wild-type mannosidase II. A mutant may arise 
naturally, or may be made artificially (for example using molecular biology techniques). The 
mutant may also not be "made" at all in the conventional sense, but merely tested 
theoretically using the model of the present invention. A mutant may or may not be 
functional. 

Thus, using the model of the present invention, the effect of a particular mutation on the 
overall two and/or three dimensional structure of a mannosidase II and/or the interaction 
between the enzyme and a ligand or substrate can be investigated. Alternatively, the 
polypeptide may perform an analogous function or be suspected to show a similar catalytic 
mechanism to the mannosidase II enzyme. For example the polypeptide may remove, 
transport, or add on a sugar residue. 
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The polypeptide may also be the same as the polypeptide of the crystal, but in association 
with a different ligand (for example, modulator or inhibitor) or cofactor. In this way it is 
possible to investigate the effect of altering a ligand or compound with which the polypeptide 
is associated on the structure of the LBD. 

5 

Secondary or tertiary structure may be determined by applying the structural coordinates of 
the crystal or model of the present invention to other data such as an amino acid sequence, X- 
ray crystallographic diffraction data, or nuclear magnetic resonance (NMR) data. Homology 
modeling, molecular replacement, and nuclear magnetic resonance methods using these other 
10 data sets are described below. 

Homology modeling (also known as comparative modeling or knowledge-based modeling) 
methods develop a three dimensional model from a polypeptide sequence based on the 
structures of known proteins (i.e. mannosidase II of the crystal). The method utilizes a 

15 computer model of the crystal of the present invention (the "known structure"), a computer 
representation of the amino acid sequence of the polypeptide with an unknown structure, and 
standard computer representations of the structures of amino acids. The method in particular 
comprises the steps of; (a) identifying structurally conserved and variable regions in the 
known structure; (b) aligning the amino acid sequences of the known structure and unknown 

20 structure (c) generating coordinates of main chain atoms and side chain atoms in structurally 
conserved and variable regions of the unknown structure based on the coordinates of the 
known structure thereby obtaining a homology model; and (d) refining the homology model 
to obtain a three dimensional structure for the unknown structure. This method is well known 
to those skilled in the art (Greer, 1985, Science 228, 1055; Bundell et al 1988, Eur. J. 

25 Biochem. 172, 513; Knighton et al., 1992, Science 258:130-135, 
http://biochem.vt.edu/courses/modeling/homology.htn). Computer programs that can be used 
in homology modeling are Quanta and the Homology module in the Insight II modelling 
package distributed by Molecular Simulations Inc, or MODELLER (Rockefeller University, 
www.iucr.ac.uk/sinris-top/logical/prg-modeller.html). 



30 
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In step (a) of the homology modeling method, the known mannosidase II structure is 
examined to identify the structurally conserved regions (SCRs) from which an average 
structure, or framework, can be constructed for these regions of the protein. Variable regions 
(VRs), in which known structures may differ in conformation, also must be identified. SCRs 
5 generally correspond to the elements of secondary structure, such as alpha-helices and beta- 
sheets, and to ligand- and substrate-binding sites (e.g. acceptor and donor binding sites). The 
VRs usually lie on the surface of the proteins and form the loops where the main chain turns. 

Many methods are available for sequence alignment of known structures and unknown 
10 structures. Sequence alignments generally are based on the dynamic programming algorithm 
of Needleman and Wunsch [J. Mol. Biol. 48: 442-453, 1970]. Current methods include 
FASTA, Smith- Waterman, and BLASTP, with the BLASTP method differing from the other 
two in not allowing gaps. Scoring of alignments typically involves construction of a 20x20 
matrix in which identical amino acids and those of similar character (i.e., conservative 
15 substitutions) may be scored higher than those of different character. Substitution schemes 
which may be used to score alignments include the scoring matrices PAM (Dayhoff et al, 
Meth. Enzymol. 91: 524-545, 1983), and BLOSUM (Henikoff and Henikoff, Proc. Nat. Acad. 
Sci. USA 89: 10915-^0919, 1992), and the matrices based on alignments derived from three- 
dimensional structures including that of Johnson and Overington (JO matrices) (J. Mol. Biol. 
20 233:716-738, 1993). 

Alignment based solely on sequence may be used; however, other structural features also may 
be takea into account. In Quanta, multiple sequence alignment algorithms are available that 
may be used when aligning a sequence of the unknown with the known structures. Four 
25 scoring systems (i.e. sequence homology, secondary structure homology, residue accessibility 
homology, CA-CA distance homology) are available, each of which may be evaluated during 
an alignment so that relative statistical weights may be assigned. 

When generating coordinates for the unknown structure, main chain atoms and side chain 
30 atoms, both in SCRs and VRs need to be modeled. A variety of approaches known to those 
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skilled in the art may be used to assign coordinates to the unknown. In particular, the 
coordinates of the main chain atoms of SCRs will be transferred to the unknown structure. 
VRs correspond most often to the loops on the surface of the polypeptide and if a loop in the 
known structure is a good model for the unknown, then the main chain coordinates of the 
5 known structure may be copied. Side chain coordinates of SCRs and VRs are copied if the 
residue type in the unknown is identical to or very similar to that in the known structure. For 
other side chain coordinates, a side chain rotamer library may be used to define the side chain 
coordinates. When a good model for a loop cannot be found fragment databases may be 
searched for loops in other proteins that may provide a suitable model for the unknown. If 
10 desired, the loop may then be subjected to conformational searching to identify low energy 
conformers if desired. 

Once a homology model has been generated it is analyzed to determine its correctness. A 
computer program available to assist in this analysis is the Protein Health module in Quanta 

15 which provides a variety of tests. Other programs that provide structure analysis along with 
output include PROCHECK and 3D-Profiler [Luthy R. et al, Nature 356: 83-85, 1992; and 
Bowie, J.U. et al, Science 253: 164-170, 1991]. Once any irregularities have been resolved, 
the entire structure may be further refined. Refinement may consist of energy minimization 
with restraints, especially for the SCRs. Restraints may be gradually removed for subsequent 

20 minimizations. Molecular dynamics may also be applied in conjunction with energy 
minimization. 

Molecular replacement involves applying a known structure to solve the X-ray 
crystallographic data set of a polypeptide of unknown structure. The method can be used to 

25 define the phases describing the X-ray diffraction data of a polypeptide of unknown structure 
when only the amplitudes are known. Thus in an embodiment of the invention, a method is 
provided for determining three dimensional structures of polypeptides with unknown structure 
by applying the structural coordinates of the crystal of the present invention to provide an X- 
ray crystallographic data set for a polypeptide of unknown structure, and (b) determining a 

30 low energy conformation of the resulting structure. 
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Molecular replacement computer programs generally involve the following steps: (1) 
determining the number of molecules in the unit cell and defining the angles between them 
(self rotation function); (2) rotating the known structure against diffraction data to define the 
orientation of the molecules in the unit cell (rotation function); (3) translating the known 
structure in three dimensions to correctly position the molecules in the unit cell (translation 
function); (4) determining the phases of the X-ray diffraction data and calculating an R-factor 
calculated from the reference data set and from the new data wherein an R-factor between 30- 
50% indicates that the orientations of the atoms in the unit cell have been reasonably 
determined by the method; and (5) optionally, decreasing the R-factor to about 20% by 
refining the new electron density map using iterative refinement techniques known to those 
skilled in the art (refinement). 

In an embodiment of the invention, a method is provided for determining three dimensional 
structures of polypeptides with unknown structure (e.g. additional native or mutated 
mannosidase II enzymes) by applying the structural coordinates of a mannosidase II structure 
to provide an X-ray crystallographic data set for a polypeptide of unknown structure, and (b) 
determining a low energy conformation of the resulting structure. 

The structural coordinates of the crystal of the present invention may be applied to nuclear 
magnetic resonance (NMR) data to determine the three dimensional structures of polypeptides 
with uncharacterised or incompletely characterised sturcture. (See for example, Wuthrich, 
1986, John Wiley and Sons, New York: 176-199; Pflugrath et al., 1986, J. Molecular Biology 
189: 383-386; Kline et al., 1986 J. Molecular Biology 189:377-382). While the secondary 
structure of a polypeptide may often be determined by NMR data, the spatial connections 
between individual pieces of secondary structure are not as readily determined. The structural 
coordinates of a polypeptide defined by X-ray crystallography can guide the NMR 
spectroscopist to an understanding of the spatial interactions between secondary structural 
elements in a polypeptide of related structure. Information on spatial interactions between 
secondary structural elements can greatly simplify Nuclear Overhauser Effect (NOE) data 
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from two-dimensional NMR experiments. In addition, applying the structural coordinates 
after the determination of secondary structure by NMR techniques simplifies the assignment 
of NOE's relating to particular amino acids in the polypeptide sequence and does not greatly 
bias the NMR analysis of polypeptide structure. 

5 

In an embodiment, the invention relates to a method of determining three dimensional 
structures of polypeptides with unknown structures, by applying the structural coordinates of 
a crystal of the present invention to nuclear magnetic resonance (NMR) data of the unknown 
structure. This method comprises the steps of: (a) determining the secondary structure of an 

10 unknown structure using NMR data; and (b) simplifying the assignment of through-space 
interactions of amino acids. The term " through-space interactions" defines the orientation of 
the secondary structural elements in the three dimensional structure and the distances between 
amino acids from different portions of the amino acid sequence. The term "assignment" 
defines a method of analyzing NMR data and identifying which amino acids give rise to 

15 signals in the NMR spectrum. 

SCREENING METHOD 

The present invention also provides a method of screening for a ligand that associates with a 
20 ligand binding domain and/or modulates the function of mannosidase II, by using a crystal or 
a model according to the present invention. The method may involve investigating whether a 
test compound is capable of associating with or binding a ligand binding domain. 

In accordance with an aspect of the present invention, a method is provided for screening for 
25 a ligand capable of binding to a ligand binding domain, wherein said method comprises the 
use of a crystal or model according to the invention. 

In another aspect, the invention relates to a method of screening for a ligand capable of 
binding to a ligand binding domain, wherein the ligand binding domain is defined by the 
30 amino acid residue structural coordinates given herein, the method comprising contacting the 



40 

ligand binding domain with a test compound and determining if said test compound binds to 
said ligand binding domain. 

In one embodiment, the present invention provides a method of screening for a test compound 
capable of interacting with a key amino acid residue of the ligand binding domain of 
mannosidase II. 

Another aspect of the invention provides a process comprising the steps of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to a ligand binding domain; and 

(c) preparing a quantity of said one or more ligands. 

A further aspect of the invention provides a process comprising the steps of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to a ligand binding domain; and 

(c) preparing a pharmaceutical composition comprising said one or more ligands. 

Once a test compound capable of interacting with a key amino acid residue in a mannosidase 
II LBD has been identified, further steps may be carried out either to select and/or to modify 
compounds and/or to modify existing compounds, to modulate the interaction with the key 
amino acid residues in the mannosidase II LBD. 

Yet another aspect of the invention provides a process comprising the steps of: 

(a) performing the method of screening for a ligand as described above; 

(b) identifying one or more ligands capable of binding to a ligand binding domain; 

(c) modifying said one or more ligands capable of binding to a ligand binding 
domain; 

(d) performing said method of screening for a ligand as described above; 

(e) optionally preparing a pharmaceutical composition comprising said one or 
more ligands. 
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As used herein, the term "test compound" means any compound which is potentially capable 
of associating with a ligand binding domain. If, after testing, it is determined that the test 
compound does bind to the LBD, it is known as a "ligand". 

5 

A "test compound" includes, but is not limited to, a compound which may be obtainable from 
or produced by any suitable source, whether natural or not. The test compound may be 
designed or obtained from a library of compounds which may comprise peptides, as well as 
other compounds, such as small organic molecules and particularly new lead compounds. By 

10 way of example, the test compound may be a natural substance, a biological macromolecule, 
or an extract made from biological materials such as bacteria, fungi, or animal (particularly 
mammalian) cells or tissues, an organic or an inorganic molecule, a synthetic test compound, 
a semi-synthetic test compound, a carbohydrate, a monosaccharide, an oligosaccharide or 
polysaccharide, a glycolipid, a glycopeptide, a saponin, a heterocyclic compound, a structural 

15 or functional mimetic, a peptide, a peptidomimetic, a derivatised test compound, a peptide 
cleaved from a whole protein, or a peptides synthesised synthetically (such as, by way of 
example, either using a peptide synthesizer or by recombinant techniques or combinations 
thereof), a recombinant test compound, a natural or a non-natural test compound, a fusion 
protein or equivalent thereof and mutants, derivatives or combinations thereof. 

20 

The test compound may be screened as part of a library or a data base of molecules. Data 
bases which may be used include ACD (Molecular Designs Limited), NCI (National Cancer 
Institute), CCDC (Cambridge Crystallographic Data Center), CAST (Chemical Abstract 
Service), Derwent (Derwent Information Limited), Maybridge (Maybridge Chemical 
25 Company Ltd), Aldrich (Aldrich Chemical Company), DOCK (University of California in 
San Francisco), and the Directory of Natural Products (Chapman & Hall). Computer 
programs such as CONCORD (Tripos Associates) or DB-Converter (Molecular Simulations 
Limited) can be used to convert a data set represented in two dimensions to one represented in 
three dimensions. 

30 
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Test compounds may be tested for their capacity to fit spatially into a mannosidsase II LBD. 
As used herein, the term "fits spatially" means that the three-dimensional structure of the test 
compound is accommodated geometrically in a cavity or pocket of the mannosidase II LBD. 
The test compound can then be considered to be a ligand. 

5 

A favourable geometric fit occurs when the surface areas of the test compound is in close 
proximity with the surface area of the cavity or pocket without forming unfavorable 
interactions. A favourable complementary interaction occurs where the test compound 
interacts by hydrophobic, aromatic, ionic, dipolar, or hydrogen donating and accepting forces. 
10 Unfavourable interactions may be steric hindrance between atoms in the test compound and 
atoms in the binding site. 

If a model of the present invention is a computer model, the test compounds may be 
positioned in an LBD through computational docking. If, on the other hand, the model of the 
15 present invention is a structural model, the test compounds may be positioned in the LBD by, 
for example, manual docking. 

As used herein the term "docking" refers to a process of placing a compound in close 
proximity with a mannosidase II LBD, or a process of finding low energy conformations of a 
20 test compound/glycosyltransferase complex. 



A screening method of the present invention may comprise the following steps: 

(i) generating a computer model of a mannosidase II or a selected site thereof 
using a crystal according to the first aspect of the invention; 
25 (ii) docking a computer representation of a test compound with the computer 

model; 

(iii) analysing the fit of the compound in the mannosidase II or selected site. 



In an aspect of the invention a method is provided comprising the following steps: 
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(a) docking a computer representation of a structure of a test compound into a 
computer representation of a binding domain of a mannosidase II defined in 
accordance with the invention using a computer program, or by interactively 
moving the representation of the test compound into the representation of the 

5 binding domain; 

(b) characterizing the geometry and the complementary interactions formed between 
the atoms of the binding domain and the compound; optionally 

(c) searching libraries for molecular fragments which can fit into the empty space 
between the compound and binding domain and can be linked to the compound; 

10 and 

(d) linking the fragments found in (c) to the compound and evaluating the new 
modified compound. 

In an embodiment of the invention a method is provided which comprises the following steps: 
15 (a) docking a computer representation of a test compound from a computer data 

base with a computer representation of a selected site (e.g. the inhibitor 
binding domain) on a mannosidase II structure defined in accordance with the 
invention to obtain a complex; 

(b) determining a conformation of the complex with a favourable geometric fit and 
20 favourable complementary interactions; and 

(c) identifying test compounds that best fit the selected site as potential modulators 
of the mannosidase II. 

A method of the invention may be applied to a plurality of test compounds, to identify those 
25 that best fit the selected site. 

The model used in the screening method may comprise the ligand-binding domain of a 
mannosidase II enzyme either alone or in association with one or more ligands and/or 
cofactors. For example, the model may comprise the ligand-binding domain in association 
30 with a substrate or analogue thereof. 
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If the model comprises an unassociated ligand binding domain, then the selected site under 
investigation may be the LBD itself. The test compound may, for example, mimic a known 
substrate for the enzyme in order to interact with the LBD. The selected site may 
alternatively be another site on the enzyme. 

If the model comprises an associated LBD, for example an LBD in association with a 
substrate molecule or analogue thereof, the selected site may be the LBD or a site made up of 
the LBD and the complexed ligand, or a site on the ligand itself. The test compound may be 
investigated for its capacity to modulate the interaction with the associated molecule. 

A test compound (or plurality of test compounds) may be selected on the basis of its similarity 
to a known ligand for the mannosidase II. For example, the screening method may comprise 
the following steps: 

(i) generating a computer model of the LBD of a mannosidase II in complex with 
a ligand; 

(ii) searching for a test compound with a similar three dimensional structure and/or 
similar chemical groups; and 

(iii) evaluating the fit of the test compound in the LBD. 

Searching may be carried out using a database of computer representations of potential 
compounds, using methods known in the art. 

The present invention also provides a method for designing ligands for a mannosidase II. It is 
well known in the art to use a screening method as described above to identify a test 
compound with promising fit, but then to use this test compound as a starting point to design a 
ligand with improved fit to the model. A known modulator can also be modified to enhance 
its fit with a model of the invention. Such techniques are known as "structure-based ligand 
design" (See Kuntz et al., 1994, Acc. Chem. Res. 27:117; Guida, 1994, Current Opinion in 
Struc. Biol. 4: 777; and Colman, 1994, Current Opinion in Struc. Biol. 4: 868, for reviews of 



45 

structure-based drug design and identification;and Kuntz et al 1982, J. Mol. Biol. 162:269; 
Kuntz et al, 1994, Acc. Chem. Res. 27: 117; Meng et al., 1992, J. Compt. Chem. 13: 505; 
Bohm, 1994, J. Comp. Aided Molec. Design 8: 623 for methods of structure-based modulator 
design). 

5 

Examples of computer programs that may be used for structure-based ligand design are 
CAVEAT (Bartlett et al., 1989, in "Chemical and Biological Problems in Molecular 
Recognition", Roberts, S.M Ley, S.V.; Campbell, N.M. eds; Royal Society of Chemistry: 
Cambridge, pp 182-196); FLOG (Miller et al., 1994, J. Comp. Aided Molec. Design 8:153); 
10 PRO Modulator (Clark et al., 1995 J. Comp. Aided Molec. Design 9:13); MCSS (Miranker 
and Karplus, 1991, Proteins: Structure, Function, and Genetics 8:195); and, GRID (Goodford, 
1985, J. Med. Chem. 28:849). 



The method may comprise the following steps: 
15 (i) docking a model of a test compound with a model of a selected site; 

(ii) -identifying one or more groups on the test compound which may be modified 
to improve their fit in the selected site; 

(iii) replacing one or more identified groups to produce a modified test compound 
model; and 

20 (iv) docking the modified test compound model with the model of the selected site. 



Evaluation of fit may comprise the following steps: 

(a) mapping chemical features of a test compound such as by hydrogen bond donors or 
acceptors, hydrophobic/lipophilic sites, positively ionizable sites, or negatively 

25 ionizable sites; and 

(b) adding geometric constraints to selected mapped features. 



The fit of the modified test compound may then be evaluated using the same criteria. 
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The chemical modification of a group may either enhance or reduce hydrogen bonding 
interaction, charge interaction, hydrophobic interaction, Van Der Waals interaction or dipole 
interaction between the test compound and the key amino acid residue(s) of the selected site. 
Preferably the group modifications involve the addition, removal, or replacement of 
5 substituents onto the test compound such that the substituents are positioned to collide or to 
bind preferentially with one or more amino acid residues that correspond to the key amino 
acid residues of the selected site. 

Identified groups in a test compound may be substituted with, for example, alkyl, alkoxy, 
10 hydroxyl, aryl, cycloalkyl, alkenyl, alkynyl, thiol, thioalkyl, thioaryl, amino, or halo groups. 
Generally, initial substitutions are conservative, i.e., the replacement group will have 
approximately the same size, shape, hydrophobicity and charge as the original group. It 
should, of course, be understood that components known in the art to alter conformation 
should be avoided. 

15 

If a modified test compound model has an improved fit, then it may bind to the selected site 
and be considered to be a "ligand". Rational modification of groups may be made with the 
aid of libraries of molecular fragments which may be screened for their capacity to fit into the 
available space and to interact with the appropriate atoms. Databases of computer 
20 representations of libraries of chemical groups are available commercially, for this purpose. 

A test compound may also be modified "/w situ" (i.e. once docked into the potential binding 
site), enabling immediate evaluation of the effect of replacing selected groups. The computer 
representation of the test compound may be modified by deleting a chemical group or groups, 

25 replacing chemical groups, or by adding a chemical group or groups. After each modification 
to a compound, the atoms of the modified compound and potential binding site can be shifted 
in conformation and the distance between the modulator and the active site atoms may be 
scored on the basis of geometric fit and favourable complementary interactions between the 
molecules. This technique is described in detail in Molecular Simulations User Manual, 1995 

30 in LUDI. 
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Examples of ligand building and/or searching computer include programs in the Molecular 
Simulations Package (Catalyst), ISIS/HOST, ISIS/BASE, and ISIS/DRAW (Molecular 
Designs Limited), and UNITY (Tripos Associates). 

5 

The "starting point" for rational ligand design may be a known ligand for the enzyme. For 
example, in order to identify potential modulators of the mannosidase II, a logical approach 
would be to start with a known ligand (for example a substrate molecule or inhibitor ) to 
produce a molecule which mimics the binding of the ligand. Such a molecule may, for 
10 example, act as a competitive inhibitor for the true ligand, or may bind so strongly that the 
interaction (and inhibition) is effectively irreversible. 

Such a method may comprise the following steps: 

(i) generating a computer model of a LBD of a mannosidase II in complex with a 
15 ligand; 

(ii) replacing one or more groups on the ligand model to produce a modified 
ligand; and 

(iii) evaluating the fit of the modified ligand in the LBD. 

20 The replacement groups could be selected and replaced using a compound construction 
program ' which replaces computer representations of chemical groups with groups from a 
computer database, where the representations of the compounds are defined by structural 
coordinates. 

25 In an embodiment, a screening method is provided for identifying a ligand of a mannosidase 
II comprising the step of using the structural coordinates of a substrate molecule or 
swainsonine or component thereof, defined in relation to its spatial association with a 
mannosidase II structure or a ligand binding domain of the invention, to generate a compound 
that is capable of associating with the mannosidase II or ligand binding domain. 

30 
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In an embodiment of the invention, a screening method is provided for identifying a ligand of 
a mannosidase II comprising the step of using the structural coordinates of swainsonine listed 
in Table 2 or 8 to generate a compound for associating with a ligand binding domain of a 
mannosidase II as described herein. The following steps are employed in a particular method 
5 of the invention: (a) generating a computer representation of swainsonine, defined by its 
structural coordinates listed in Table 2 or 8; (b) searching for molecules in a data base that are 
structurally or chemically similar to the defined swainsonine, using a searching computer 
program, or replacing portions of the compound with similar chemical structures from a 
database using a compound building computer program. 

10 

The screening methods of the present invention may be used to identify compounds or entities 
that associate with a molecule that associates with a mannosidase II enzyme (for example, a 
substrate molecule). 

15 Compounds and entities (e.g. ligands) of mannosidase II identified using the above-described 
methods may be prepared using methods described in standard reference sources utilized by 
those skilled in the art. For example, organic compounds may be prepared by organic 
synthetic methods described in references such as March, 1994, Advanced Organic 
Chemistry: Reactions, Mechanisms, and Structure, New York, McGraw Hill. 

20 

Test compounds and ligands which are identified using a crystal or model of the present 
invention can be screened in assays such as those well known in the art. Screening can be, for 
example, in vitro, in cell culture, and/or in vivo. Biological screening assays preferably centre 
on activity-based response models, binding assays (which measure how well a compound 

25 binds to the receptor), and bacterial, yeast and animal cell lines (which measure the biological 
effect of a compound in a cell). The assays can be automated for high capacity-high 
throughput screening (HTS) in which large numbers of compounds can be tested to identify 
compounds with the desired activity. The biological assay, may also be an assay for the 
ligand binding activity of a compound that selectively binds to the LBD compared to other 

30 nuclear receptors. 
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UGANDS/COMPOUNDS/MOOULATORS 

The present invention provides a ligand or compound or entity identified by a screening 
5 method of the present invention. A ligand or compound may have been designed rationally 
by using a model according to the present invention. A ligand or compound identified using 
the screening methods of the invention specifically associate with a target compound. In the 
present invention the target compound may be the mannosidase II enzyme or a molecule that 
is capable of associating with the mannosidase II enzyme (for example a substrate molecule). 
10 In a preferred embodiment the ligand is capable of binding to the LBD of a mannosidase IL 

A ligand or compound identified using a screening method of the invention may act as a 
"modulator", i.e. a compound which affects the activity of a mannosidase II. A modulator 
may reduce, enhance or alter the biological function of a mannosidase IL For example a 
15 modulator may modulate the capacity of the enzyme to hydrolyse mannose residues. An 
alteration in biological function may be characterised by a change in specificity. For 
example, a modulator may cause the enzyme to accept a different substrate molecule, to 
transfer a different sugar, or to work with a different metal cofactor. In order to exert its 
function, the modulator commonly binds to the ligand binding domain. 

20 

A "modulator" which is capable of reducing the biological function of the enzyme may also 
be known as an inhibitor. Preferably an inhibitor reduces or blocks the capacity of the 
enzyme to hydrolyse mannose residues. The inhibitor may mimic the binding of a substrate 
molecule, for example, it may bo a substrate analogue- A substrate analogue may be designed 

25 by considering the interactions between the substrate molecule and the enzyme (for example 
by using information derivable from the crystal of the invention) and specifically altering one 
or more groups (as described above). 

In a highly preferred embodiment a modulator acts as an inhibitor of the mannosidase 
II and is capable of inhibiting N-glycan biosynthesis. In another embodiment a modulator 

30 enhances mannosidase II activity and is capable of regulating the immune system. 
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The present invention also provides a method for modulating the activity of a mannosidase II 
within a cell using a modulator according to the present invention. It would be possible to 
monitor the expression of N-glycans on the cell surface following such treatment by a number 
5 of methods known in the art (for example by detecting expression with an N-glycan specific 
antibody). 

In another preferred embodiment, the modulator modulates the catalytic mechanism of the 
enzyme. 

10 

A modulator may be an agonist, partial agonist, partial inverse agonist or antagonist of the 
mannosidase II. 

As used herein, the term "agonist" means any ligand, which is capable of binding to a ligand 
15 binding domain and which is capable of increasing a proportion of the enzyme that is in an 
active form, resulting in an increased biological response. The term includes partial agonists 
and inverse agonists. 

As used herein, the term "partial agonist" means an agonist that is unable to evoke the 
20 maximal response of a biological system, even at a concentration sufficient to saturate the 
specific receptors. 

As used herein, the term "partial inverse agonist" is an inverse agonist that evokes a 
submaximal response to a biological system, even at a concentration sufficient to saturate the 
25 specific receptors. At high concentrations, it will diminish the actions of a full inverse 
agonist. 

The invention relates to a mannosidase II ligand binding domain antagonist, wherein said 
ligand binding domain is that defined by the amino acid structural coordinates described 
30 herein. For example the ligand may antagonise the inhibition of mannosidase by swainsonine. 
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As used herein, the term "antagonist" means any agent that reduces the action of another 
agent, such as an agonist. The antagonist may act at the same site as the agonist (competitive 
antagonism). The antagonistic action may result from a combination of the substance being 
5 antagonised (chemical antagonism) or the production of an opposite effect through a different 
receptor (functional antagonism or physiological antagonism) or as a consequence of 
competition for the binding site of an intermediate that links receptor activation to the effect 
observed (indirect antagonism). 

10 As used herein, the term "competitive antagonism" refers to the competition between an 
agonist and an antagonist for a receptor that occurs when the binding of agonist and 
antagonist becomes mutually exclusive. This may be because the agonist and antagonist 
compete for the same binding site or combine with adjacent but overlapping sites. A third 
possibility is that different sites are involved but that they influence the receptor 

15 macromolecules in such a way that agonist and antagonist molecules cannot be bound at the 
same time. If the agonist and antagonist form only short lived combinations with the receptor 
so that equilibrium between agonist, antagonist and receptor is reached during the presence of 
the agonist, the antagonism will be surmountable over a wide range of concentrations. In 
contrast, some antagonists, when in close enough proximity to their binding site, may form a 

20 stable covalent bond with it and the antagonism becomes insurmountable when no spare 
receptors remain. 

As mentioned above, an identified ligand or compound may act as a ligand model (for 
example, a template) for the development of other compounds. A modulator may be a 

25 mimetic of a ligand or ligand binding domain. A mimetic of a ligand may compete with a 
natural ligand for a mannosidase II and antogonize a physiological effect of the enzyme in an 
animal. A mimetic of a ligand may be an organically synthesized compound. A mimetic of a 
ligand binding domain, may be either a peptide or other biopharmaceutical (such as an 
organically synthesized compound) that specifically binds to a natural substrate molecule for 

30 a mannosidase II and antagonize a physiological effect of the enzyme in an animal. 
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A modulator may be one or a variety of different sorts of molecule. For example, a modulator 
may be a peptide, member of random peptide libraries and combinatorial chemistry-derived 
molecular libraries, phosphopeptide (including members of random or partially degenerate, 
5 directed phosphopeptide libraries), a carbohydrate, a monosaccharide, an oligosaccharide or 
polysaccharide, a glycolipid, a glycopeptide, a saponin, a heterocyclic compound antibody, 
carbohydrate, nucleoside or nucleotide or part thereof, and small organic or inorganic 
molecule. A modulator may be an endogenous physiological compound, or it may be a natural 
or synthetic compound. The modulators of the present invention may be natural or synthetic. 
10 The term "modulator" also refers to a chemically modified ligand or compound, and includes 
isomers and racemic forms. 

Once a ligand has been optimally selected or designed, substitutions may then be made in 
some of its atoms or side groups in order to improve or modify its binding properties. 
15 Generally, initial substitutions are conservative, i.e., the replacement group will have 
approximately the same size, shape, hydrophobicity and charge as the original group. It 
should, of course, be understood that components known in the art to alter conformation 
should be avoided. Such substituted chemical compounds may then be analyzed for efficiency 
of fit to the mannosidase II LBD by the same computer methods described above. 

20 

Preferably, positions for substitution are selected based on the predicted binding orientation of 
a ligand to the mannosidase II LBD. 

A technique suitable for preparing a modulator will depend on its chemical nature. For 
25 example, organic compounds may be prepared by organic synthetic methods described in 
references such as March, 1994, Advanced Organic Chemistry: Reactions, Mechanisms, and 
Structure, New York, McGraw Hill. Peptides can be synthesized by solid phase techniques 
(Roberge JY et al (1995) Science 269: 202-204) and automated synthesis may be achieved, 
for example, using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer) in accordance with the 
30 instructions provided by the manufacturer. Once cleaved from the resin, the peptide may be 
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purified by preparative high performance liquid chromatography (e.g., Creighton (1983) 
Proteins Structures and Molecular Principles, WH Freeman and Co, New York NY). The 
composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing 
(e.g., the Edman degradation procedure; Creighton, supra). 

5 

If a modulator is a nucleotide, or a polypeptide expressable therefrom, it may be synthesized, 
in whole or in part, using chemical methods well known in the art (see Caruthers MH et al 
(1980) Nuc Acids Res Symp Ser 215-23, Horn T et al (1980) Nuc Acids Res Symp Ser 225- 
232), or it may be prepared using recombinant techniques well known in the art. 

10 

Direct synthesis of a ligand or mimetics thereof can be performed using various solid-phase 
techniques (Roberge JY et al (1995) Science 269: 202-204) and automated synthesis may be 
achieved, for example, using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer) in 
accordance with the instructions provided by the manufacturer. Additionally, the amino acid 
15 sequences obtainable from the ligand, or any part thereof, may be altered during direct 
synthesis and/or combined using chemical methods with a sequence from other subunits, or 
any part thereof, to produce a variant ligand. 

In an alternative embodiment of the invention, the coding sequence of a ligand or mimetics 
20 thereof may be synthesized, in whole or in part, using chemical methods well known in the art 
(see Caruthers MH et al (1980) Nuc Acids Res Symp Ser 215-23, Horn T et al (1980) Nuc 
Acids Res Symp Ser 225-232). 

A wide variety of host cells can be employed for expression of the nucleotide sequences 
25 encoding a ligand of the present invention. These cells may be both prokaryotic and 
eukaryotic host cells. Suitable host cells include bacteria such as E. coli, yeast, filamentous 
fungi, insect cells, mammalian cells, typically immortalized, e.g., mouse, CHO, human and 
monkey cell lines and derivatives thereof. Preferred host cells are able to process the 
expression products to produce an appropriate mature polypeptide. Processing includes but is 
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not limited to glycosylation, ubiquitination, disulfide bond formation and general post- 
translational modification. 

In an embodiment of the present invention, the ligand may be a derivative of, or a chemically 
5 modified ligand. The term "derivative" or "derivatised" as used herein includes the chemical 
modification of a ligand. 

A chemical modification of a ligand and/or a key amino acid residue of a ligand binding 
domain of the present invention may either enhance or reduce hydrogen bonding interaction, 
10 charge interaction, hydrophobic interaction, Van Der Waals interaction or dipole interaction 
between the ligand and the key amino acid residue(s) of the mannosidase II LBD. By way of 
example, steric hinderance is a common means of changing the interaction of the 
mannosidase II LBD binding domain with the activation domain. 

15 Preferably such modifications involve the addition of substituents onto a test compound such 
that the substituents are positioned to collide or to bind preferentially with one or more amino 
acid residues that correspond to the key amino acid residues of mannosidase II LBD of the 
present invention. Typical modifications may include, for example, the replacement of a 
hydrogen by a halo group, an alkyl group, an acyl group or an amino group. 

20 

The invention also relates to classes of modulators of mannosidase II based on the structure 
and shape of a substrate, defined in relation to the substrate's molecule's spatial association 
with a mannosidase II structure of the invention or part thereof. Therefore, a modulator may 
comprise a substrate molecule having the shape or structure, preferably the structural 
25 coordinates, of a substrate molecule in the active site binding pocket of a reaction catalyzed 
by a mannosidase II. In an embodiment, the substrate comprises GlcNAcMan 5 GlcNAc2-Asn- 

A modulator may be an inhibitor of a mannosidase II such as swainsonine or a derivative or 
30 mimetic thereof 
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A class of modulators of mannosidase II enzymes may comprise a compound containing a 
structure of swainsonine, and having one or more, preferably all, of the structural coordinates 
of swainsonine of Table 2 or 8. Functional groups in the swainsonine modulators may be 
5 substituted with, for example, alkyl, alkoxy, hydroxyl, aryl, cycloalkyl, alkenyl, alkynyl, thiol, 
thioalkyl, thioaryl, amino, or halo, or they may be modified using techniques known in the art. 
Substituents will be selected to optimize the activity of the modulator. 

PHARMACEUTICAL COMPOSITION 

10 

The present invention also provides the use of a ligand or modulator according to the 
invention, in the manufacture of a medicament to treat and/or prevent a disease in a 
mammalian patient. There is also provided a pharmaceutical composition comprising such a 
ligand or modulator and a method of treating and/or preventing a disease comprising the step 
15 of administering such a modulator or pharmaceutical composition to a mammalian patient. 

In an embodiment, the invention relates to a pharmaceutical composition which comprises a 
crystal structure of the invention or a part thereof (e.g. a binding domain), or a modulator of 
the invention in an amount effective to regulate one or more of the conditions described 
20 herein (e.g. tumor growth or metastasis) and a pharmaceutically acceptable carrier, diluent or 
excipient. 

The pharmaceutical compositions may be for human or animal usage in human and veterinary 
medicine and will typically comprise a pharmaceutically acceptable carrier, diluent, excipient, 
25 adjuvant or combination thereof. 

Acceptable carriers or diluents for therapeutic use are well known in the pharmaceutical art, 
and are described, for example, in Remington's Pharmaceutical Sciences, Mack Publishing 
Co. (A. R. Gennaro edit 1985). The choice of pharmaceutical carrier, excipient or diluent 
30 can be selected with regard to the intended route of administration and standard 
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pharmaceutical practice. The pharmaceutical compositions may comprise as - or in addition 
to - the carrier, excipient or diluent any suitable binder(s), lubricant(s), suspending agent(s), 
coating agent(s), solubilising agent(s). 

5 A pharmaceutical composition of the invention can be administered to a subject in an 
appropriate carrier or diluent, co-administered with enzyme inhibitors or in an appropriate 
carrier such as microporous or solid beads or liposomes. Liposomes include water-in-oil-in- 
water emulsions as well as conventional liposomes (Strejan et al., (1984) J. Neuroimmunol 
7:27). 

10 

Preservatives, stabilizers, dyes and even flavouring agents may be provided in the 
pharmaceutical composition. Examples of preservatives include sodium benzoate, sorbic acid 
and esters of p-hydroxybenzoic acid. Antioxidants and suspending agents may also be used. 

15 The routes for administration (delivery) include, but are not limited to, one or more of: oral 
(e.g. as a tablet, capsule, or as an ingestable solution), topical, mucosal (e.g. as a nasal spray 
or aerosol for inhalation), nasal, parenteral (e.g. by an injectable form), gastrointestinal, 
intraspinal, intraperitoneal, intramuscular, intravenous, intrauterine, intraocular, intradermal, 
intracranial, intratracheal, intravaginal, intracerebroventricular, intracerebral, subcutaneous, 

20 ophthalmic (including intravitreal or intracameral), transdermal, rectal, buccal, vaginal, 
epidural, sublingual. 

Where the pharmaceutical composition is to be delivered mucosally through the 
gastrointestinal mucosa, it should be able to remain stable during transit through the 
25 gastrointestinal tract; for example, it should be resistant to proteolytic degradation, stable at 
acid pH and resistant to the detergent effects of bile. 

It is to be understood that not all of the agent need be administered by the same route. 
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Where appropriate, the pharmaceutical compositions can be administered by inhalation, in the 
form of a suppository or pessary, topically in the form of a lotion, gel, hydrogel, solution, 
cream, ointment or dusting powder, by use of a skin patch, orally in the form of tablets 
containing excipients such as starch or lactose or chalk, or in capsules or ovules either alone 
5 or in admixture with excipients, or in the form of elixirs, solutions or suspensions containing 
flavouring or colouring agents, or they can be injected parenterally, for example 
intravenously, intramuscularly or subcutaneously. For parenteral administration, the 
compositions may be best used in the form of a sterile aqueous solution which may contain 
other substances, for example enough salts or monosaccharides to make the solution isotonic 
10 with blood. The aqueous solutions should be suitably buffered (preferably to a pH of from 3 
to 9), if necessary. The preparation of suitable parenteral formulations under sterile 
conditions is readily accomplished by standard pharmaceutical techniques well-known to 
those skilled in the art. 

15 If the agent of the present invention is administered parenterally, then examples of such 
administration include one or more of: intravenously, intra-arterially, intraperitoneally, 
intrathecally, intraventricularly, intraurethrally, intrasternally, intracranially, intramuscularly 
or subcutaneously administering the agent; and/or by using infusion techniques. 

20 For buccal or sublingual administration the compositions may be administered in the form of 
tablets or lozenges which can be formulated in a conventional manner. 

The tablets may contain excipients such as microcrystalline cellulose, lactose, sodium citrate, 
calcium carbonate, dibasic calcium phosphate and glycine, disintegrants such as starch 
25 (preferably corn, potato or tapioca starch), sodium starch glycollate, croscarmellose sodium 
and certain complex silicates, and granulation binders such as polyvinylpyrrolidone, 
hydroxypropylmethylcellulose (HPMC), hydroxypropylcellulose (HPC), sucrose, gelatin and 
acacia. Additionally, lubricating agents such as magnesium stearate, stearic acid, glyceryl 
behenate and talc may be included. 

30 
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Solid compositions of a similar type may also be employed as fillers in gelatin capsules. 
Preferred excipients in this regard include lactose, starch, cellulose, milk sugar or high 
molecular weight polyethylene glycols. For aqueous suspensions and/or elixirs, the agent 
may be combined with various sweetening or flavouring agents, colouring matter or dyes, 
5 with emulsifying and/or suspending agents and with diluents such as water, ethanol, 
propylene glycol and glycerin, and combinations thereof. 

As indicated, a therapeutic agent of the present invention can be administered intranasally or 
by inhalation and is conveniently delivered in the form of a dry powder inhaler or an aerosol 

10 spray presentation from a pressurised container, pump, spray or nebuliser with the use of a 
suitable propellant, e.g. dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, a hydrofluoroalkane such as 1,1,1,2-tetrafluoroethane (HFA 
134A™) or 1,1,1,2,3,3,3-heptafluoropropane (HFA 227EA™), carbon dioxide or other 
suitable gas. In the case of a pressurised aerosol, the dosage unit may be determined by 

15 providing a valve to deliver a metered amount. The pressurised container, pump, spray or 
nebuliser may contain a solution or suspension of the active compound, e.g. using a mixture 
of ethanol and the propellant as the solvent, which may additionally contain a lubricant, e.g. 
sorbitan trioleate. Capsules and cartridges (made, for example, from gelatin) for use in an 
inhaler or insufflator may be formulated to contain a powder mix of the agent and a suitable 

20 powder base such as lactose or starch. 

Therapeutic administration of polypeptide modulators may also be accomplished using gene 
therapy. A nucleic acid including a promoter operatively linked to a heterologous polypeptide 
may be used to produce high-level expression of the polypeptide in cells transfected with the 
25 nucleic acid. DNA or isolated nucleic acids may be introduced into cells of a subject by 
conventional nucleic acid delivery systems. Suitable delivery systems include liposomes, 
naked DNA ? and receptor-mediated delivery systems, and viral vectors such as retroviruses, 
herpes viruses, and adenoviruses. 



30 



APPLICATIONS 
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The modulators and compositions of the invention may be used to modulate the biological 
activity of a mannosidase II in a cell, including modulating a pathway in a cell regulated by 
5 the mannosidase II or modulating a mannosidase II with inappropriate activity in a cellular 
organism. In addition, a mannosidase II structure of the invention may be used to devise 
protocols to modulate the biological activity of a mannosidase II in a cell. 

Cellular assays, as well as animal model assays in vivo, may be used to test the activity of a 
10 potential modulator of a mannosidase II as well as diagnose a disease associated with 
inappropriate mannosidase II activity. In vivo assays are also useful for testing the bioactivity 
of a potential modulator designed by the methods of the invention. 

The invention further provides a method of treating a mammal, the method comprising 
15 administering to a mammal a modulator or pharmaceutical composition of the present 
invention. 

Typically, a physician will determine the actual dosage which will be most suitable for an 
individual subject and it will vary with the age, weight and response of the particular patient 
20 and severity of the condition. The dosages below are exemplary of the average case. There 
can, of course, be individual instances where higher or lower dosage ranges are merited. 

The specific dose level and frequency of dosage for any particular patient may be varied and 
will depend upon a variety of factors including the activity of the specific compound 

25 employed, the metabolic stability and length of action of that compound, the age, body 
weight, general health, sex, diet, mode and time of administration, rate of excretion, drug 
combination, the severity of the particular condition, and the individual undergoing therapy. 
By way of example, the pharmaceutical composition of the present invention may be 
administered in accordance with a regimen of 1 to 10 times per day, such as once or twice per 

30 day. 
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For oral and parenteral administration to human patients, the daily dosage level of the agent 
may be in single or divided doses. 

5 The modulators (e.g. inhibitors) identified using the methods of the invention may be useful 
in the treatment and prophylaxis of tumor growth and metastasis of tumors. Anti-metastatic 
effects of inhibitors can be demonstrated using a lung colonization assay. For example, 
melanoma cells treated with an inhibitor may be injected into mice and the ability of the 
melanoma cells to colonize the lungs of the mice may be examined by counting tumor 
10 nodules on the lungs after death. Suppression of tumor growth in mice by the inhibitor 
administered orally or intravenously may be examined by measuring tumor volume. 

An inhibitor identified using the invention may have particular application in the prevention 
of tumor recurrence after surgery i.e. as an adjuvant therapy. 

15 

An inhibitor may be especially useful in the treatment of various forms of neoplasia such as 
leukemias, lymphomas, melanomas, adenomas, sarcomas, and carcinomas of solid tissues in 
patients. In particular, inhibitors can be used for treating malignant melanoma, pancreatic 
cancer, cervico-uterine cancer, ovarian cancer, cancer of the kidney such as metastatic renal 

20 cell carcinoma, stomach, lung, rectum, breast, bowel, gastric, liver, thyroid, head and neck 
cancers such as unresectable head and neck cancers, lymphangitis carcinamatosis, cancers of 
the cervix, breast, salivary gland, leg, tongue, lip, bile duct, pelvis, mediastinum, urethra, 
bronchogenic, bladder, esophagus and colon, non-small cell lung cancer, and Karposi's 
Sarcoma which is a form of cancer associated with HIV-infected patients with Acquired 

25 Immune Deficiency Syndrome (AIDS). The inhibitors may also be used for other anti- 
proliferative conditions such as bacterial and viral infections, in particular AIDS. 

An inhibitor identified in accordance with the present invention may be used to treat 
immunocompromised subjects. For example, they may be used in a subject infected with 
30 HIV, or other viruses or infectious agents including bacteria, fungi, and parasites, in a subject 
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undergoing bone marrow transplants, and in subjects with chemical or tumor-induced immune 
suppression. 

Inhibitors may be used as hemorestorative agents and in particular to stimulate bone marrow 
5 cell proliferation, in particular following chemotherapy or radiotherapy. The 
myeloproliferative activity of an inhibitor of the invention may be determined by injecting the 
inhibitor into mice, sacrificing the mice, removing bone marrow cells and measuring the 
ability of the inhibitor to stimulate bone marrow proliferation by directly counting bone 
marrow cells and by measuring clonogenic progenitor cells in methylcellulose assays. The 
10 inhibitors can also be used as chemoprotectants, and in particular to protect mucosal 
epithelium following chemotherapy. 

An inhibitor identified in accordance with the invention also may be used as an antiviral agent 
in particular on membrane enveloped viruses such as retroviruses, influenza viruses, 
15 cytomegaloviruses and herpes viruses. An inhibitor may also be used to treat bacterial, fungal, 
and parasitic infections. An inhibitor may also be used in the treatment of inflammatory 
diseases such as rheumatoid arthritis, asthma, inflammatory bowel disease, and 
atherosclerosis. 

20 An inhibitor may also be used to augment the anti-cancer effects of agents such as interleukin- 
2 and poly-IC, to augment natural killer and macrophage tumoricidal activity, induce cytokine 
synthesis and secretion, enhance expression of LAK and HLA class I specific antigens; 
activate protein kinase C, stimulate bone marrow cell proliferation including hematopoietic 
progenitor cell proliferation, and increase engraftment efficiency and colony-forming unit 

25 activity, to confer protection against chemotherapy and radiation therapy (e.g. 
chemoprotective and radioprotective agents), and to accelerate recovery of bone marrow 
cellularity particularly when used in combination with chemical agents commonly used in the 
treatment of human diseases including cancer and acquired immune deficiency syndrome 
(AIDS). For example, an inhibitor can be used as a chemoprotectant in combination with anti- 
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cancer agents including doxorubicin, 5-fhiorouracil, cyclophosphamide, and methotrexate, 
and in combination with isoniazid or NSAID. 

Alpha-niannosidosis may also be amendable to treatment or prophylaxis by the 
method of the present invention. 
5 The loss of mannosidase II has been found to alter N-glycan branching and attenuate 

the immune system's ability to maintain self-tolerance (Chui et al> PNAS 98Q):1142-1 147, 
2001). Therefore, the structures, modulators:, compositions, and methods of the invention may 
be useful in the treatment or prophylaxis of autoimmune disease including systemic lupus 
erythematosus. 

10 The present invention thus provides a method for treating the above-mentioned 

conditions in a subject comprising administering to a subject an effective amount of a 
modulator of the invention. The invention also contemplates a method for stimulating or 
inhibiting tumor growth or metastasis in a subject comprising administering to a subject an 
effective amount of a modulator of the invention, 

15 The following non-limiting examples are illustrative of the present invention, 

EXAMPLES 
Example 1 

Drosophila Mannosidase II preparation and structure determination 
Expression Plasmids 

20 Constructs designed to expressed dGMTI in Drosophila Schneider (S2) cells were based on 
the DES expression system available from InVitrogen with extensive modifications. 
Expression plasmids were conslructed which had the dGMII under the control either of the 
inducible metallothioneine (MT) promoter or the strong constitutive actin 5.1 promoter 
(ACS). Amino terminal purification tags were inserted in place of the C-terminal tags in the 

25 commercially available vectors. Earlier attempts, to truncate the mouse enzyme from at the C- 
terminus resulted in inactive protein, as had also been noted with the GlcNAc-transferases. 
Thus, it was elected to keep the C-tenninus free. Expression vectors were created with either a 
6His-tag 3 for purification on metal chelate columns such as Ni-NTA (Qiagen) or cobalt based 
Talon columns (Clontech), or with a Strep-tag for purification on streptavidin-Sepharose. 

30 These affinity tags are initially non^cleavable and add approximately S-10 residues to the end 
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of the protein. Finally, constructs were made either lacking or containing the Bip secretion 
sequence to direct the expressed protein into the cells or medium respectively. 



Blasticidin Selection 

5 Initial attempts at stable transfection with the recommended hygromycin selection system 
were unsuccessful. Therefore a new selection plasmid, pCopBlast was created which encodes 
blasticidin S deaminase under the control of the constitutive copia promoter. Blasticidin S 
has been used for stable transfectants of mammalian and plant cells, as well as yeast. 
Commercially available control plasmids expressing MT-induced secreted green fluorescent 

10 protein (GFP), or constitutive and MT-induced unsecreted bacterial p-galactosidase (LacZ) 
were used to test the suitability of blasticidin selection in S2 cells, and to optimize conditions 
for transfection, selection, and metallothionein induction. Stable transfectants could be 
selected with 16 fig/ml blasticidin in Schneider's S2 medium containing 10% fetal bovine 
serum. Copper and cadmium were the only metals found to activate the MT promoter; copper 

15 favoured internally expressed proteins and cadmium, secreted proteins. Maintenance of the 
altered phenotype was also demonstrated for many weeks in the absence of the selective 
pressure of blasticidin demonstrating that these were indeed stably transfected cell lines. 

Creation of Stably Expressing dGMH cell lines. 

20 Starting with the pProtA expression plasmid from initial published studies [Rabouille et al, 
1999], the mannosidase coding region was excised, and inserted into an in-frame EcoRI site 
immediately at the end of the affinity tag in the new plasmids. The position of a unique 3' 
restriction site outside the coding region meant that 100-200 bp of extra sequence was added 
between the stop codon and the SV40 polyadenylation site. This extra sequence was 

25 removed with a short PCR amplification using a unique internal restriction site. Both ends of 
the constructs were sequenced to verify proper reading frame and lack of PCR errors. The 
resulting constructs consist of the dGMII catalytic region with a short length of the stalk 
region, in a variety of "flavours" of promoter, affinity tag, and expression location. 
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Co-transfection of the pCopBlast selection plasmid with the mannosidase expression 
plasmids, followed by selection for blasticidin resistance allowed stable expressing cell lines 
after approximately one month. Mannosidase activity was measured using PNP-mannoside, 
in a microtitre plate assay. Protein was detected on Western blots using anti-PentaHis 
5 antibody {Qiagen). Only the secreted products showed activity, with similar levels in the 
constitutive and MT-promoter constructs. No difference in mannosidase activity was seen 
between His or Strep tagged protein. All subsequent work was carried out with the secreted 
constructs. 

10 Insect cells do not grow at low population densities. Therefore, the initial population of 
selected cells was a mixed population with each cell in the culture having somewhat different 
levels of incorporated expression plasmid. To select individual cells with high levels of 
expression the stably transfected population was diluted to single cells in a 50:50 mix of 
conditioned medium and fresh medium with blasticidin. These were then plated in 96-well 

15 culture plates. After five weeks, about 10% of the wells showed growths of colonies large 
enough to transfer, of which roughly 30% had activity. The highest expressors had 
approximately 5 times the activity of the initial population in the MT-inducible strains. High- 
expressing clones of the constitutively expressed dGMII, were obtained suggesting that the 
continued production mannosidase by the cells may be detrimental, especially under the 

20 stressful conditions of single-cell selection. 

Expression and purification of dGMIL 

The availability of a stable clones expressing considerable amounts of mannosidase allowed 
optimization of induction, expression and purification conditions. In contrast to mammalian 
25 cells, insect cells are not highly adherent and will grow to high cell densities in a variety of 
culture vessels including roller bottles, spinners, fermentors and shake flasks. No C0 2 is 
required, and temperatures in the range of 25-28°C are optimal. With stably transfected cells, 
the difficulties that accompany baculoviral infection do not arise. 
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Initial experiments were carried out in S2 medium containing 10% bovine serum. Metal 
concentrations used to induce and time of induction were optimized for dGMH production. 
10-20 ^iM cadmium proved optimal for induction. Although copper (at approximately 500- 
1000 jaM) is generally used in the literature for induction, the sensitivity of dGMII to 
5 inhibition by copper (IC 50 = 25^iM,[26]) precluded its use. Cadmium has been reported to be 
detrimental to the growth of cells. However, at the concentrations used here, the cells 
continued to grow and maintain greater than 90% viability (as assessed by Trypan blue 
exclusion) until the end of the induction period. Cells were maintained in the continous 
presence of cadmium for up to three passages. 

10 

As the dGMII was secreted into the medium, it was badly contaminated with bovine serum 
albumin (BSA). Attempts to remove the impurity by Blue Agarose or Ni-NTA 
chromatography were unsuccessful. To circumvent this contamination problem a number of 
serum-free media were evaluated for growth and expression levels. There are very few 
15 serum-free media developed for Drosophila cells so ones that have been used with 
baculovirus expression systems were evaluated. Ultimately the Excel420 medium from JRH 
Biosciences was successful. 

A further advantage to this medium is the incorporation of seleno-methionine in place of 
20 methionine for crystallographic phasing purposes. A custom preparation of this medium was 
purchased from JRH free of Met and Cu. Inclusion of 50 jag/ml of SeMet resulted in the 
production of protein with high enough incorporation (approximately 50% by mass 
spectrometry) for accurate phasing. 

25 Cells were adapted to serum-free growth by gradual dilution with CCM3 medium and then 
they were switched into the other media for the expression studies. Excel420, CCM3 and 
SFX-Insect were clearly superior for maintaining healthy growth, though CCM3 provided 
slightly lower levels of expression. Levels of cadmium required for induction were optimized 
for each medium and were considerably lower than those required in S2 medium. For 

30 unknown reasons, constitutive expression of dGMII was much lower in serum-free medium. 
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Therefore, all subsequent scale-up and purifications were carried out with the MT-inducible 
6His tagged constructs. 



To scale-up protein expression cells were first grown as suspensions in spinner cultures. 
5 These were subsequently put into 2.8 litre Fernbach flasks (1 litre Excel 420/flask) shaken at 
100 rpm at 28°C. Cells were induced for 72 hours with 10 uM cadmium. After this time the 
medium was asceptically harvested and the cells are placed in the same volume of fresh 
medium for a further round of induction. This can be repeated at least one more time without 
significant cell death or loss of protein expression. Based on activity measurements up to 50 
10 mg/litre of medium can be expressed every three days. This is approximately 1000 fold 
greater than in initial expression experiments in CHOP cells [Rabouille et al, 1999]. This 
procedure requires about 2 weeks of dedicated time in an incubator/shaker. 

Purification is effected by batch binding first to Blue- Agarose, with elution by 350 mM NaCl, 
15 and then to Ni-NTA resin, with elution by 50 mM imidizole. Initial, secreted protein from the 
medium of the serum-free grown cells was loaded in batch to Blue- Agarose. The beads were 
then loaded into a column and washed with 20 column volumes of 50 mM NaCl in 20 mM 
Tris pH8. The majority of the mannosidase was eluted with 350 mM NaCl. This pooled eluant 
was loaded onto NiNTA, washed with low imidizole, and eluted with 50 mM imidizole to 
20 achieve crystallization purity. The protein is then dialysed extensively against 10 mM Tris, 
pH 8.3 and 100 mM NaCl and concentrated (to greater than 20 mg/ml) for crystallization 
trials. All crystallization has been carried out from a single protein preparation. 

Crystallization 

25 Crystals of Drosophila Mannosidase II and complexes of the enzyme with various inhibitors 
were grown at room temperature using vapor diffusion and micro-batch crystallization 
techniques. Crystals were obtained under a wide variety of conditions. Polyethylene glycol 
(PEG) was used as a precipitant (with sizes: 4000; 6000; 8000; 10000; and 20000) at 
concentrations varying from 5-20%, in the presence of 5% 2,4-methyl-pentanediol (MPD) or 

30 0-30% glycerol. Crystallization solutions were buffered at pH 7-7.5 using 100 mM buffer 
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solutions of Tris, Hepes or Mes. The crystals belong to the orthorhombic space group F2\2\2\ 
with cell dimensions: a=69A; b=l 10A; c=139A; a-90°; (3=90°; y=90°. For the initial structure 
determination Seleno-Methionine-derivatized Mannosidase II crystals were grown in 8.5% 
PEG 6000, 5% MPD and 100 mM Tris pH 7,0, using micro seeds obtained from wild-type 
5 enzyme crystals. Data were collected from crystals that were frozen in liquid nitrogen after a 
stepwise increase of the MPD concentration in the crystallization solution from 5% to 25%. 

A crystal of the invention is illustrated in the Figures. In particular, Figure 1 shows the active 
site of a mannosidase IL Figure 2 shows the secondary structure of Drosophila Golgi ct- 

10 mannosidase II. Helices are in blue and (3 sheets are in red. Figure 3 shows the Drosophila 
golgi a-mannosidase II molecule with the colours representing where it is identical to human 
GMIL The red and blue represent deletions or insertions with respect to the human sequence. 
The green is a disulphide bond. Figure 4 shows the whole Drosophila golgi a-mannosidase II 
molecule in sticks with residues that are identical in the lysosomal manll as coloured balls 

15 (red or blue depending whether they are in the N-terminal or C-terminal part of the molecule). 
Figure 5 shows the active site of a Drospholiga mannosidase. Figure 6 shows the DNA 
sequence of an expressed Drosophila mannosidase. Figure 7 shows an alignment of expressed 
secreted Drosophila mannosidase with human mannosidase. 

20 Example 2 

Experimental Procedures 

Protein Overexpression and Purification 

Expression, purification and crystallization of the dGMII will be described in detail 
elsewhere. Briefly, the cDNA was inserted behind an inducible promoter, and used to stably 
25 transfect Drosophila S2 cells. Single cell clones secreting high levels of dGMII were chosen 
and adapted to serum-free medium. Unlabelled dGMII was isolated from the supernatants of 
cells grown in Fernbach flasks by batch binding to Blue-Agarose (Sigma). The protein was 
eluted from the Blue-Agarose using NaCl and further purified by Ni-NTA chromatography 
(Qiagen). EDTA (5 mM) was added to scavenge any free nickel. The protein was extensively 
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dialyzed against 10 mM Tris pH 8 containing 100 mM NaCl, concentrated to 25 mg/ml, and 
stored in aliquots at -80 °C. 



For seleno-methionine labeling, a custom batch of Ex-Cell 420 (#006140E JRH Biosciences, 
5 Lenexa KS) was used which lacked any added methionine or copper. Cells were grown to 
high cell density in a spinner flask in standard medium, resuspended in the "methionine-free" 
medium and allowed to starve for 4 hours prior to the addition of 50 mg/1 of seleno- 
methionine (Sigma). After 70 hrs of induction the protein was purified from the supernatant as 
outlined above except that 5 mM p-mercaptoethanol was present throughout the purification. 

10 

Crystallization and Data Collection 

Crystals of Drosophila Mannosidase II and complexes of the enzyme with various inhibitors 
were grown at room temperature using vapor diffusion and micro-batch crystallization 
techniques. Crystals were obtained under a wide variety of conditions. Polyethylene glycol 

15 (PEG) was used as a precipitant (with sizes: 4000; 6000; 8000; 10000; and 20000) at 
concentrations varying from 5-20%, in the presence of 5% 2,4-methyl-pentane-diol (MPD) or 
0-30% glycerol Crystallization solutions were buffered at pH 7-7.5 using 100 mM buffer 
solutions of Tris, Hepes or Mes. The crystals belong to the orthorhombic space group P2j2i2i 
with cell dimensions: a=69A; b=l 10A; c=139A; <x=90°; (3=90*; y=90\ For the initial structure 

20 determination Seleno-Methionine-derivatized Mannosidase II crystals were grown in 8.5% 
PEG 6000, 5% MPD and 100 mM Tris pH 7.0, using micro seeds obtained from wild-type 
enzyme crystals. Data were collected from crystals that were frozen in liquid nitrogen after a 
stepwise increase of the MPD or glycerol concentration in the crystallization solution from 
5% to 25%. Data collection was performed at the Advanced Photon Source facility at 

25 Argonne National Laboratories, Argonne, Illinois. Beam line BM14D was used for collection 
of multiple wavelength anomalous dispersion data and BM14C for collection of high- 
resolution data. 

Structure Determination 
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The structure of uncomplexed dGMII was determined by MAD phasing at the Selenium 
absorption edge with datasets collected at an absorption peak wavelength of 0.9786 A, 
inflection wavelength of 0.9790A and a remote wavelength of 0.9770 A. Initial positions of 
26 out of 28 Selenium atoms were determined with the program Solve (Terwilliger et aL, 
5 1987) with an initial Figure of Merit (FOM) of 0.67. The experimental map obtained after 
density modification, using the program DM of the CCP4 program package (Cowtan, 1994), 
showed continuous density of very high quality for the whole molecule. The structure was 
traced using the program O (Jones et al., 1991) using the density modified experimental map. 
The model was refined using the program CNS (Briinger et al., 1998). 

10 

Inductively Coupled Plasma Atomic Emission Spectroscopy (ICP-AES) 

The metal content in dGMII samples was analyzed by inductively coupled plasma atomic 
emission spectroscopy using the ICP-AES model 'Optima 3000 DV (Dual View) from 
Perkin Elmer. The zinc content in the protein samples was determined relative to an 
15 equivalent amount of dGMII assay buffer. 

RESULTS AND DISCUSSION 

Protein expression 

20 The cDNA for Drosophila GMII is predicted to encode a protein of 1108 amino acids. For 
protein expression in Drosophila cells the first 75 amino acids consisting of the cytosolic and 
transmembrane domains and most of the stalk region were eliminated. The remaining cDNA 
was cloned in-frame behind a secretion signal. 

25 Numbering of our construct starts at the point where the expressed protein is expected to be 
cleaved, by signal peptidase, from the secretion signal. Three extra amino terminal residues, a 
6-histidine tag, and a glycine, glutamine and phenylalanine were added in cloning. The first 
aspartate (D13) of the construct corresponds to aspartate 76 of the native protein. The first 
residue seen in the structure (C31) corresponds to C94, and the final residue SI 044 to SI 107, 

30 of the full-length sequence. 
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Structure Determinations 

The structure of Drosophila Golgi a-mannosidase II has been determined by the multi- 
wavelength anomalous dispersion (MAD) phasing method using a data set collected from a 
5 crystal of Seleno-methionine derivatized enzyme (Table 9). This is the first reported structure 
of a Se-Met substituted enzyme produced in a Drosophila overexpression system. The native 
dGMII structure has been refined to a resolution of 1.76A with some data to 1.4 A resolution 
(see refinement statistics presented in Table 10). The model contains residues 31-1044 of the 
recombinant enzyme (numbered as described above), as well as a zinc ion, an N~glycan 

10 residue, a molecule of the cryo-protectant, 2-methyl-2,4-pentanediol (MPD), and a 
tris(hydroxymethyl)-aminomethane (Tris) molecule. The presence of the enzyme-bound zinc 
ion was confirmed by Inductively Coupled Plasma Atomic Emission Spectroscopy (ICP- 
AES). The final structure of the dGMII-swainsonine complex has been refined at 1.87A 
resolution and the dGMII-DMNJ complex to L69A resolution, with some data to 1.5 A 

15 resolution. 

Overall Architecture of dGMII 

The structure of dGMII reveals a previously unobserved protein fold consisting of an N- 
terminal a/p domain, a three-helical bundle and an all-P C-terminal domain forming a single 

20 compact entity, connected by 5 internal disulfide bonds and stabilized by a zinc binding site 
(Figure 8B). The oval shaped molecule has two distinct faces (Figure 8C). The N-terminal 
face of the molecule is convex, whereas the opposing face of the enzyme has a planar surface. 
N-terminal residue Cys-31 is the last residue of the so-called stalk region, the linkage between 
the catalytic domain and the transmembrane domain. Cys-3 1 is located at the convex face of 

25 the molecule, indicating that this surface of the molecule presumably faces the inner side of 
the Golgi membrane, while the planar surface, containing the active site cavity (see below), 
faces the Golgi lumen. 

The N-terminal a/p" domain is comprised of an inner core of three p-sheets (A, B and C, 
30 Figure 8B) consisting of 11, mostly parallel p-strands, surrounded by 16 a-helices. This 
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domain contains a GlcNAc residue found in the electron density map at a consensus N- 
glycosylation site (Asn-194), located at the N-terminus of helix 7. The a/p domain is 
stabilized by three disulfide bonds: between Cys-31 and Cys-1032 connecting the N and C- 
terminal extremes of dGMII; Cys-275 and Cys-282 linking helices 10 and 11; Cys-283 and 
5 Cys-297 linking helix 1 1 with a loop between helix 13 and the core of parallel P-sheets. The 
cysteines forming the latter two disulfide-bonds are conserved in the human Golgi a- 
mannosidase II sequence. 

The C-terminal half of the protein contains a three-helix bundle, comprised of helices 18, 20 
10 and 21, and is connected to the N-terminal ct/p-domain via a zinc binding site. The zinc ion is 
coordinated in a T 5 -square-based pyramidal geometry involving residues: Asp-90, His-92, 
Asp-204 and His-471. Furthermore, the C-terminal domain contains two immunoglobulin-like 
domains: a small p-sandwich consisting of 12 anti-parallel strands from p-sheets D and E, and 
a large 21 -strand structure involving P-sheets F and G. 

15 

A barrel formed by the three-helix bundle and helix-23 together with the two p-sandwich 
structures result in a narrow pore in the center of the C-terminal domain. The pore is lined by 
six arginine residues: Arg-540, 565, 617, 770, 777 and 893, contributing to the overall 
positive charge of the pore (Figure 9A). A hairpin loop, connecting two strands of p-sheet D 

20 (Figure 8B and C, residues 527-540, shown in yellow) protrudes into the center of the barrel 
on the planar side of the molecule. Arginine residue 530, located at the tip of the type-I p-turn 
in this loop, plugs the pore preventing an open channel through the protein. The resulting 
crater-like cavity on the convex side of the molecule is 20A deep, with a diameter of 20A 
funneling to 8A at the bottom of the cavity. B-factor values of residues within the loop 

25 indicate a higher degree of flexibility compared to the rest of the structure (average B-factor 
values: -33 A 2 and -15 A 2 , respectively). 
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Active Site 

The molecular surface representation of the planar face of dGMII reveals an extended pocket 
in the N-terminal a/p-domain, formed primarily by acidic residues (Figure 9B). These same 
residues form the core of a large, contiguous, surface-exposed patch, of highly conserved 
amino acids, in comparison with the human GMII sequence (Figure 9C). The active site of the 
enzyme is located in a small cavity in the side of this conserved, negatively charged region. 
The cavity is lined by aromatic residues Trp-95, Phe-206, Tyr-269 and Tyr-727, which are 
involved in hydrophobic and hydrogen-bond interactions with a bound Tris molecule in the 
unliganded structure (Figure 10A). Tris is known to inhibit dGMII activity (Rabouille et al., 
1999). Additional hydrophobic and hydrogen bond interactions are observed with Asp-92 and 
Asp-204. At the open side of the cavity the Tris molecule hydrogen bonds with Arg-228, Tyr- 
269 and Asp-341 (not shown) via water molecules. 

A key feature of the active site is the coordination of the zinc ion by the Tris hydroxyl group 
02. In the enzyme-Tris complex the zinc ion is bound in a T 5 -square-based pyramidal 
geometry, coordinated by the 0D1 oxygen moieties of aspartate residues 92 and 204; the NE2 
nitrogens of histidines 90 and 471; and the hydroxyl oxygen 02 of the bound Tris molecule, 
as represented in Figure 10A. The T 5 geometry is further stabilized by hydrogen bonds 
between the zinc coordinating atoms and the existence of H-bonds between the ND1 nitrogen 
atoms of the histidines 90 and 471 with the carbonyl oxygen of seleno-methionine 167 and a 
water molecule, respectively (not shown). The presence of these, so called, 'elec-His-Zn 
motifs' is believed to increase the basicity and the ligand strength of the histidine and arrange 
it correctly for interaction with the metal (Alberts et al., 1998). In an uninhibited enzyme, Tris 
would likely be replaced by a coordinating water molecule. As discussed below, this 
arrangement has implications for substrate binding and transition state stabilization. 

The occurrence of zinc in Family 38 glycosyl hydrolases has been described by Snaith (1975) 
in Jack-bean a-mannosidase. A possible role for zinc in catalysis was indicated by 
inactivation of the enzyme by chelating agents and bivalent metal ions such as Cu 2+ . Copper 
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has also been shown to effectively inactivate Drosophila and mouse GMII (Rabouille et al, 
1999). 

Inhibitor Binding 

5 The structures of dGMII in complex with the inhibitors DMNJ and swainsonine show that 
both compounds bind to the same active site in a similar manner (Figure 10B and C). The 
binding of both inhibitors involves a large contribution of hydrophobic interactions involving 
aromatic residues Trp-95, Phe-206 and Tyr-727, forming the walls of the cavity. The inhibitor 
ring structures are stacked against Trp-95, a feature seen in several carbohydrate binding and 

10 hydrolyzing proteins (see Boraston et al., 2000 and review papers therein), and stabilized by 
hydrogen bonds and interactions with the zinc ion. In the complexes of dGMII with either 
DMNJ or swainsonine the T 5 geometry of the bound zinc ion, as seen in the Tris-bound 
enzyme, is transformed into T 6 -octahedral coordination. In both the dGMII complexes the 
inhibitor 02 hydroxyl oxygen replaces the 02 oxygen of Tris and the 03 hydroxyl oxygen 

15 forms the apex of the second pyramid. In order to obey the restraints of the T 6 geometry, the 
plane of the swainsonine ring structure is tilted with respect to the saccharide-like ring of the 
bound DMNJ molecule. This enables the formation of a hydrogen bond between the zinc- 
coordinating ODl oxygen of Asp-204 and the N4 nitrogen at the fusion of the five and six- 
membered rings of swainsonine. As in the Tris-bound enzyme, the zinc coordinating oxygen 

20 atoms of the inhibitors are involved in hydrogen bond interactions with the neighboring metal 
binding residues of the enzyme. 

The position of the DMNJ and swainsonine molecules is stabilized in the active site by 
hydrogen bonds between carboxylic oxygens ODl and OD2 of residue Asp-472 and hydroxyl 
25 oxygens 03 and 04 (05 in swainsonine) of the inhibitors, analogous to the 01 and 02 
interactions seen in the enzyme-Tris complex. As in the Tris-bound enzyme, DMNJ is 
involved in additional hydrogen bonds, via water molecules, with the NH2 nitrogen of Arg- 
228, the hydroxyl oxygen of Tyr-269, the backbone carbonyl oxygen of Arg-876 (not shown) 
and the ODl oxygen of Asp-204. 



30 
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The displacement of the Tris molecule by either of the inhibitors only slightly affects the zinc 
binding site by weakening the internal hydrogen bonds between Asp-204 and histidines 90 
and 471. No major conformational changes are observed between the Tris-bound and the 
inhibitor-bound mannosidase molecules as their backbones are virtually superimposable, with 
root-mean-square-deviations between Ccc atoms of 0.068A (dGMII-DMNJ complex) and 
0.087A (dGMII-swainsonine complex). 

Catalytic mechanism 

Golgi a-mannosidase II is a retaining mannosyl hydrolase, which cleaves the linkage between 
the CI atom of M7 and M6 (Figure 8 A) and, respectively, the 03 and 06 atom of the de- 
linked mannosyl branch (M4) of GlcNAcMan 5 GlcNAc 2 . The catalytic mechanism is proposed 
to follow a very similar path to the corresponding retaining (5-glycosidases (Braun et al., 
1995; White and Rose, 1997). This is a two-stage reaction that usually involves two 
carboxylic acids, one acting as a nucleophile attacking the glycosidic bond, and the other as a 
general acid/base catalyst. Nucleophilic attack of one carboxylic acid results in glycosylation 
of the enzyme by forming a covalent intermediate followed by a second deglycosylation step, 
each step passing through an oxocarbonium ion-like transition state. 

Based on the structure of the dGMII-inhibitor complexes we speculate that the mannose 
residues on the otl,6-linked mannosyl branch (M4) bind to the enzyme at the same site and in 
the same manner as mannose-like inhibitor DMNJ. Coordination of the zinc ion with the 02 
and 03 hydroxyl oxygens thereby contributes to the enzyme's specificity for mannose. Four 
acidic amino acid residues, Asp-92, Asp-204, Asp-341 and Asp-472, are candidates for 
catalytic side chains based on their proximity to the active site (Figure IOC). Results from a 
recent study on the mechanism of catalysis in Jack-bean a-mannosidase by Withers and co- 
workers, using reagents that trap the glycosyl-enzyme intermediate, identified an aspartate 
residue as the catalytic nucleophile in that enzyme (Howard et al., 1998). Comparison of the 
highly conserved sequence region surrounding this aspartate in Jack-bean a-mannosidase 
with the same sequence region in dGMII suggests that aspartate residue 204 in dGMII is the 
catalytic nucleophile that attacks the glycosidic linkage. For this reaction it is required that 
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Asp-204 is close to the anomeric carbon of the mannose substrate. In the dGMII-DMNJ 
complex, however, the equivalent anomeric carbon is located 4.6A from the nucleophile. 
Binding of the C2 and C3 substituent hydroxyl oxygens of the flattened five-membered ring 
in swainsonine causes the inhibitor molecule to tilt, bringing its bridgehead nitrogen N4, in 
5 the analogous position to CI in the substrate, significantly closer to the putative nucleophilic 
Asp-204 (3.2A). This tilted binding mode, stabilized by a hydrogen bond between N4 and 
Asp-204 and by van der Waals stacking interactions between the 6-membered ring of 
swainsonine and Phe-206, may resemble the mode of binding of the ring-flattened transition 
state mannosyl cation. Thus, Phe-206 would stabilize the transition state by compensating for 
10 the loss of stacking interactions of the substrate with Trp-95. The highly complementary 
shape of swainsonine with the active site of dGMII, and its structural analogy with the skewed 
boat transition state conformation, could therefore explain its 10,000 times higher binding 
affinity for the enzyme, compared to the substrate-mimic DMNJ (data not shown). 

15 The OD1 oxygen of Asp-204, the putative nucleophile, directly coordinates the zinc ion, 
implicating a role for the zinc in positioning the nucleophile and in the stabilization of 
protonation states of the reacting partners. It is tempting to speculate that the change of zinc 
coordination from T5 to the less favored T6 state (Alberts et al., 1998) on substrate binding 
may also contribute to the mechanism. From the Tris and DMNJ structures, it is predicted that 

20 the coordination would revert to T5 on product release. If so, this transition may energetically 
facilitate the deglycosylation step. Such evidence of direct zinc involvement in the catalytic 
mechanism of a glycosyl hydrolase is unprecedented. Arg-288 positions Asp-204 for 
nucleophilic attack by virtue of hydrogen bond interactions between its NE and NH2 
nitrogens and the OD2 oxygen of Asp-204 (Figure 10C). Based on the expected distance 

25 between the two catalytic residues (-5.5 A, Davies and Henrissat, 1995) likely candidates for 
the catalytic base are Asp-341 and Asp-472 (preliminary indications are that the D341N 
mutant is catalytically inactive, DAK unpublished results). Recent data suggest that other 
residues, such as tyrosines, possibly play a role in glycosidic bond cleavage (Davies and 
Henrissat, 1995). Tyrosine residues 269 are 727 are positioned to help stabilize the transition 

30 state. 
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Substrate Binding and Cleavage 

The function of GMII is dependent on the presence of |31,2-GlcNAc (G3, Figure 8A), added 
to ctl,3-linked mannose (M5) by GlcNAc transferase I (see reviews: Kornfeld and Kornfeld, 
5 1985; Moremen et al., 1994). This pl,2-GlcNAc dependence suggests the presence of an 
additional saccharide-binding site in GMII. Evidence for such a binding site is provided by 
the observation of an MPD molecule in the structure of dGMII, in the vicinity of the active 
site cavity. MPD was used as a cryo-protectant during the procedure of flash-freezing of the 
crystal, prior to data collection (see experimental procedures). The replacement of MPD by 
10 the alternative cryo-protectant glycerol resulted in the occupation of this same position by a 
glycerol molecule. Glycerol has been shown to mimic saccharide binding in structures of 
glycosyl hydrolases (Schmidt et al., 1998, Vallee et al., 2000). 

The observation of the binding of MPD and glycerol near dGMIFs active site (Figure 11 A) 
15 enables a hypothesis regarding the binding and cleavage of <xl,6 and al,3-linked mannoses 
on the ocl,6-linked mannose branch of the GlcNAcMan 5 GlcNAc 2 oligosaccharide. In this 
hypothesis, the MPD binding site is suggested to be the putative site of interaction for pi, 2- 
GlcNAc (G3, Figure 8A), enabling anchoring of the oligosaccharide substrate in the 
conserved negatively charged pocket. In Figure 1 IB a model is shown of a 
20 GlcNAcMan 5 GlcNAc 2 structure with the pl,2-GlcNAc residue placed in the MPD binding 
site and the al,6-linked M6 mannose docked into the active site, with its hydroxyl oxygens 
02 and 03 coordinating the zinc ion. As required, the asparagine linked pl,4-GlcNAc 
residues Gl and G2 extend away from the surface of the molecule (into the Golgi lumen). 
Both M4 and the second substrate cd,3-linked M7 mannose are located within the conserved 
25 negatively charged pocket pointing away from the active site cavity. In this orientation it can 
be easily visualized that after cleavage of the cd,6-linked M6 the second, al,3-linked M7 can 
be brought into the active site cavity by a -180° rotation, through the extended pocket, around 
the flexible al,6-linkage of M4 (see Figure 11C). In addition to the dependence of GMIFs 
action on the presence of the G3 pl,2-GlcNAc, this model provides a mechanism for the 
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cleavage of both mannose residues without major conformational change of the enzyme, and 
more importantly, without release of the polypeptide-carbohydrate complex, anchored by the 
stationary GlcNAc, between the two cleavage events. Finally, this model suggests that the 
od,6-linked M6 mannose is preferentially cleaved first, enabling the shorter al,3-linked M7 
5 residue to rotate through the pocket with minimal steric hindrance; according to our model, 
the proposed 'swiveP mechanism would be slightly hampered should the M7 mannose be 
cleaved first. This is supported by data reported for a-mannosidase II from mung bean 
seedlings, Xenopus liver, Rat liver Golgi and for enzyme-activity in homogenates of insect 
cells, showing preferential hydrolytic activity on the M6 mannosyl residue (Kaushal et aL, 
10 1990; Altmann and Martz, 1995; Ren et aL, 1997). 
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Conclusions 

The structure of the catalytic domain of Golgi a-mannosidase II provides the basis for its zinc 
ion mediated specificity for mannose, as well as insight into its reaction mechanism. In 
addition, the result illustrates the structural basis for the mechanism of inhibition by the anti- 
5 cancer agent swainsonine, which we propose mimics aspects of the transition state binding. 
This understanding is critical for the rational design of swainsonine variants and/or novel 
mechanism-based compounds as specific a-mannosidase II inhibitors, for the treatment of 
several forms of cancer. A bound MPD molecule identifies a putative GlcNAc binding 
pocket, located near the active site and enables a hypothesis explaining the enzyme's 
10 dependency on the single GlcNAc substitution of the GlcNAcMan 5 GlcNAc 2 substrate for 
binding. Furthermore, it suggests a novel mechanism for successive hydrolysis of the a 1,6 
and al,3-linked mannose residues, resulting in the tri-mannose core glycosyl structure. 
Finally, it opens the door to the design of novel highly specific inhibitors linking together 
functional sites in the enzyme. 

15 

Various modifications and variations of the described methods and system of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the 
invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly 
20 limited to such specific embodiments. Indeed, various modifications of the described modes 
for carrying out the invention which are obvious to those skilled in chemistry or biology or 
related fields are intended to be covered by the present invention. All publications mentioned 
in the above specification are herein incorporated by reference. 
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Table 1 

Structural coordinates of a Drosophila Golgi a-mannosidase II. 

REMARK coordinates from simulated annealing refinement 

REMARK refinement resolution: 500.0 - 1.4 A 

REMARK starting r= 0.1816 free_r= 0.2003 

REMARK final r= 0.1894 free_r= 0.2063 

REMARK rmsd bonds= 0.004594 rmsd angles- 1.32379 

REMARK wa_initial- 0.264577 wa_dynamics= 0.28954 wa_final= 0.28836 
REMARK target= mlf md-method= torsion annealing schedule^ slowcool 
REMARK starting temperature^ 1000 total md steps- 40 * 6 

REMARK sg= P2 ( 1 ) 2 ( 1 ) 2 { 1 ) a- 68.865 b= 109.718 c= 138.599 alpha- 90 beta- 90 
gamma- 90 



CNSJTOPPAR:protein_rep.param 
CNS_TOPPAR: wat er_rep . param 
CNS_TOPPAR: ion. param 
trs .par 
mpd. par 

cis_peptide .param 
CNS TOPPAR: carbohydrate. param 



REMARK parameter file 1 
REMARK parameter file 2 
REMARK parameter file 3 
REMARK parameter file 4 
REMARK parameter file 5 
REMARK parameter file 6 
REMARK parameter file 7 
REMARK molecular structure file: dgmllcgen .mtf 
REMARK input coordinates: dgmllcgen . pdb 
REMARK reflection file- . . /semetHiR. cv 
REMARK ncs= none 

REMARK B-correction resolution: 6.0 - 1.4 
REMARK initial B-factor correction applied to fobs : 
REMARK Bll= 0.609 B22- -0.765 B33= 0.155 
REMARK B12= 0.000 B13= 0.000 B23= 0.000 

REMARK B-factor correction applied to coordinate array B: 0.042 
REMARK bulk solvent: density level- 0.35999 e/A A 3, B-factor= 42.8385 A"2 
REMARK reflections with | Fobs | /sigma_F < 0.0 rejected 
REMARK reflections with 1 Fobs I > 10000 * rms(Fobs) rejected 
REMARK theoretical total number of refl. in resol . range: 206243 (100.0%) 
REMARK number of unobserved reflections (no entry or |F|=0): 59797 (29.0%) 
REMARK number of reflections rejected: 0 (0.0%) 

REMARK total number of reflections used: 146446 (71.0%) 

REMARK number of reflections in working set: 139067 (67.4%) 

REMARK number of reflections in test set: 7379 (3.6%) 
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ATOM 8483 OH2 WAT W 309 43.300 80.722 5.014 1.00 22.89 W 

ATOM 8484 OH2 WAT W 310 32.873 85.309 -5.565 1.00 20.83 W 

ATOM 8485 OH2 WAT W 311 31.185 33.495 2.792 1.00 24.76 W 

ATOM 8486 OH2 WAT W 312 21.436 78.463 -1.341 1.00 20.14 W 

5 ATOM 8487 OH2 WAT W 313 55.286 59.005 3.952 1.00 16.44 W 

ATOM 8488 OH2 WAT W 314 13.406 62.292 -19.877 1.00 20.13 W 

ATOM 8489 OH2 WAT W 315 16.922 50.824 -19.239 1.00 24.71 W 

ATOM 8490 OH2 WAT W 316 14.323 62.818 -12.022 1.00 20.05 W 

ATOM 8491 OH2 WAT W 317 53.131 86.670 -21.375 1.00 24.00 W 

10 ATOM 8492 OH2 WAT W 318 70.984 49.511 -0.110 1.00 23.19 W 

ATOM 8493 OH2 WAT W 319 58.047 42.402 1.566 1.00 23.65 W 

ATOM 8494 OH2 WAT W 320 74.768 76.369 -14.340 1.00 27.79 W 

ATOM 8495 OH2 WAT W 321 24.787 83.128 -16.877 1.00 20.99 W 

ATOM 8496 OH2 WAT W 322 56.915 67.700 -35.396 1.00 21.88 W 

15 ATOM 8497 OH2 WAT W 323 48.769 50.559 -29.520 1.00 24.35 W 

ATOM 8498 OH2 WAT W 325 66.709 71.607 -6.852 1.00 19.70 W 

ATOM 8499 OH2 WAT W 326 28.790 93.746 -38.512 1.00 23.49 W 

ATOM 8500 OH2 WAT W 327 32.509 64.768 -9.926 1.00 17.49 W 

ATOM 8501 OH2 WAT W 328 13.454 50.922 -1.603 1.00 22.73 W 

20 ATOM 8502 OH2 WAT W 329 39.34 6 49.244 30.155 1.00 19.73 W 

ATOM 8503 OH2 WAT W 330 67.888 58.663 -24.054 1.00 19.93 W 

ATOM 8504 OH2 WAT W 331 20.779 58.985 -34.524 1.00 18.17 W 

ATOM 8505 OH2 WAT W 332 37.550 75.641 -42.326 1.00 19.39 W 

ATOM 8506 OH2 WAT W 333 31.895 51.294 -35.392 1.00 20.49 W 

25 ATOM 8507 OH2 WAT W 334 12.860 71.207 -0.322 1.00 26.50 W 

ATOM 8508 OH2 WAT W 335 37.017 41.136 -24.664 1.00 26.87 W 

ATOM 8509 OH2 WAT W 336 19.245 42.748 -2.573 1.00 24.91 W 

ATOM 8510 OH2 WAT W 337 50.102 44.093 7.091 1.00 17.71 W 

ATOM 8511 OH2 WAT W 338 37.447 45.431 -25.539 1.00 18.16 W 

30 ATOM 8512 OH2 WAT W 339 75.300 74.056 -19.617 1.00 26.79 W 

ATOM 8513 OH2 WAT W 340 50.344 61.083 -31.161 1.00 19.20 W 

ATOM 8514 OH2 WAT W 341 42.877 56.717 -31.559 1.00 24.68 W 

ATOM 8515 OH2 WAT W 342 13.262 68.814 0.935 1.00 21.56 W 

ATOM 8516 OH2 WAT W 343 40.735 82.850 -31.321 1.00 26.31 W 

35 ATOM 8517 OH2 WAT W 344 27.084 52.281 22.512 1.00 28.70 W 

ATOM 8518 OH2 WAT W 345 12.534 66.379 -9.439 1.00 20.41 W 

ATOM 8519 OH2 WAT W 346 42.619 39.443 -15.769 1.00 22.12 W 

ATOM 8520 OH2 WAT W 347 48.091 48.434 16.393 1.00 17.55 W 

ATOM 8521 OH2 WAT W 348 11.547 46.877 17.887 1.00 24.28 W 

40 ATOM 8522 OH2 WAT W 349 16.474 67.738 -19.969 1.00 19.28 W 

ATOM 8523 OH2 WAT W 350 5.038 52.914 -5.886 1.00 27.53 W 

ATOM 8524 OH2 WAT W 352 56.273 90.997 -25.706 1.00 22.05 W 

ATOM 8525 OH2 WAT W 353 39.139 83.260 -27.376 1.00 21.38 W 

ATOM 8526 OH2 WAT W 354 60.719 56.850 15.845 1.00 26.22 W 

45 ATOM 8527 OH2 WAT W 355 13.419 51.471 21.730 1.00 19.40 W 

ATOM 8528 OH2 WAT W 356 43.394 93.839 -37.655 1.00 21.87 W 

ATOM 8529 OH2 WAT W 357 28.161 35.317 -6.921 1.00 20.76 W 

ATOM 8530 OH2 WAT W 358 80.258 64.057 -21.537 1.00 25.16 W 

ATOM 8531 OH2 WAT W 359 51.563 45.062 9.678 1.00 28.77 W 

50 ATOM 8532 OH2 WAT W 360 21.446 46.840 -17.598 1.00 23.85 W 

ATOM 8533 OH2 WAT W 361 47.431 71.776 -20.273 1.00 25.46 W 

ATOM 8534 OH2 WAT W 362 21.320 85.819 -31.086 1.00 28.86 W 

ATOM 8535 OH2 WAT W 363 32.967 66.868 21.309 1.00 21.01 W 

ATOM 8536 OH2 WAT W 364 14.670 61.989 -14.512 1.00 20.46 W 

55 ATOM 8537 OH2 WAT W 365 50.391 73.726 -25.274 1.00 22.28 W 
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ATOM 8648 OH2 WAT W 478 70.032 61.711 -20.342 1.00 28.95 W 

ATOM 8649 OH2 WAT W 479 23.576 90.884 -26.085 1.00 32.69 W 

ATOM 8650 OH2 WAT W 480 25.257 71.144 22.415 1.00 28.48 W 

ATOM 8651 OH2 WAT W 481 14.011 69.331 26.764 1.00 25.72 W 

5 ATOM 8652 OH2 WAT W 482 62.091 80.708 -18.944 1.00 22.51 W 

ATOM 8653 OH2 WAT W 483 27.568 47.649 45.829 1.00 30.99 W 

ATOM 8654 OH2 WAT W 484 24.924 34.183 0.541 1.00 31.69 W 

ATOM 8655 OH2 WAT W 485 57.542 69.040 13.372 1.00 35.06 W 

ATOM 8656 OH2 WAT W 486 7.964 48.092 11.826 1.00 24.78 W 

10 ATOM 8657 OH2 WAT W 487 71.310 59.790 1.011 1.00 30.00 W 

ATOM 8658 OH2 WAT W 488 67.619 88.039 -23.404 1.00 24.62 W 

ATOM 8659 OH2 WAT W 489 12.380 75.159 -13.498 1.00 29.76 W 

ATOM 8660 OH2 WAT W 490 27.878 52.141 42.550 1.00 28.52 W 

ATOM 8661 OH2 WAT W 491 22.024 62.686 33.804 1.00 29.60 W 

15 ATOM 8662 OH2 WAT W 492 59.396 66.922 -26.287 1.00 39.41 W 

ATOM 8663 OH2 WAT W 493 34.320 87.124 -10.560 1.00 30.89 W 

ATOM 8664 OH2 WAT W 494 30.216 36.445 29.080 1.00 25.09 W 

ATOM 8665 OH2 WAT W 495 8.571 49.568 5.657 1.00 35.32 W 

ATOM 8666 OH2 WAT W 498 41.955 67.344 29.326 1.00 25.76 W 

20 ATOM 8667 OH2 WAT W 499 72.402 75.774 -27.267 1.00 28.93 W 

ATOM 8668 OH2 WAT W 500 8.848 44.348 12.960 1.00 29.54 W 

ATOM 8669 OH2 WAT W 501 71.065 43.044 -15.336 1.00 30.91 W 

ATOM 8670 OH2 WAT W 502 41.462 44.667 -24.401 1.00 26.41 W 

ATOM 8671 OH2 WAT W 503 39.798 66.983 31.423 1.00 27.70 W 

25 ATOM 8672 OH2 WAT W 504 53.321 52.719 26.356 1.00 25.71 W 

ATOM 8673 OH2 WAT W 505 26.304 44.377 -27.111 1.00 28.70 W 

ATOM 8674 OH2 WAT W 506 19.649 52.518 -27.061 1.00 26.07 W 

ATOM 8675 0H2 WAT W 507 70.823 95.128 -34.949 1.00 33.07 W 

ATOM 8676 OH2 WAT W 508 48.072 64.729 -25.801 1.00 25.60 W 

30 ATOM 8677 OH2 WAT W 509 44.452 64.662 -35.219 1.00 35.32 W 

ATOM 8678 OH2 WAT W 510 18.126 52.856 35.976 1.00 29.35 W 

ATOM 8679 OH2 WAT W 511 83.591 70.051 -20.249 1.00 24.79 W 

ATOM 8680 OH2 WAT W 512 43.836 79.945 -42.480 1.00 26.76 W 

ATOM 8681 OH2 WAT W 513 22.688 62.534 16.084 1.00 24.30 W 

35 ATOM 8682 OH2 WAT W 514 45.760 79.249 -7.906 1.00 24.96 W 

ATOM 8683 OH2 WAT W 515 26.801 88.079 -4.103 1.00 32.00 W 

ATOM 8684 OH2 WAT W 516 38.743 42.182 -22.199 1.00 30.07 W 

ATOM 8685 OH2 WAT W 517 36.884 29.145 13.084 1.00 31.77 W 

ATOM 8686 OH2 WAT W 518 20.410 65.507 27.349 1.00 30.30 W 

40 ATOM 8687 OH2 WAT W 519 6.729 59.284 2.971 1.00 32.47 W 

ATOM 8688 OH2 WAT W 520 61.217 77.418 -12.932 1.00 22.54 W 

ATOM 8689 OH2 WAT W 521 28.095 49.290 38.335 1.00 25.33 W 

ATOM 8690 OH2 WAT W 522 29.442 41.131 -25.576 1.00 24.39 W 

ATOM 8691 OH2 WAT W 523 19.537 67.313 -2.706 1.00 23.69 W 

45 ATOM 8692 OH2 WAT W 524 43.446 41.720 -7.635 1.00 26.49 W 

ATOM 8693 OH2 WAT W 525 48.643 55.026 -22.750 1.00 19.36 W 

ATOM 8694 OH2 WAT W 526 26.385 81.719 -3.635 1.00 41.05 W 

ATOM 8695 OH2 WAT W 527 35.767 39.049 -16.111 1.00 28.81 W 

ATOM 8696 OH2 WAT W 528 58.571 51.181 22.159 1.00 21.40 W 

50 ATOM 8697 OH2 WAT W 529 54.235 45.937 4.993 1.00 33.49 W 

ATOM 8698 OH2 WAT W 530 46.524 80.242 -42.585 1.00 25.30 W 

ATOM 8699 OH2 WAT W 531 71.643 84.509 -20.062 1.00 25.12 W 

ATOM 8700 OH2 WAT W 532 61.128 67.499 -36.724 1.00 28.07 W 

ATOM 8701 OH2 WAT W 533 37.969 31.489 16.453 1.00 39.22 W 

55 ATOM 8702 OH2 WAT W 534 58.573 34.864 -12.260 1.00 28.74 W 
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ATOM 8813 OH2 WAT W 653 39.252 43.518 26.888 1.00 28.02 W 

ATOM 8814 OH2 WAT W 654 11.931 70.822 2.485 1.00 31.13 W 

ATOM 8815 OH2 WAT W 655 36.424 62.574 -38.251 1.00 29.44 W 

ATOM 8816 OH2 WAT W 656 76.055 83.157 -25.276 1.00 39.00 W 

5 ATOM 8817 OH2 WAT W 657 37.608 80.454 -0.581 1.00 23.73 W 

ATOM 8818 OH2 WAT W 658 47.350 53.378 35.765 1.00 33.39 W 

ATOM 8819 OH2 WAT W 659 48.604 97.804 -33.661 1.00 29.32 W 

ATOM 8820 OH2 WAT W 660 63.595 75.084 -45.655 1.00 33.19 W 

ATOM 8821 OH2 WAT W 661 60.081 71.509 0.342 1.00 33.30 W 

10 ATOM 8822 OH2 WAT W 662 41.130 48.174 -29.424 1.00 33.56 W 

ATOM 8823 OH2 WAT W 663 55.384 40.246 -12.570 1.00 43.02 W 

ATOM 8824 OH2 WAT W 664 74.977 83.935 -22.956 1.00 33.05 W 

ATOM 8825 OH2 WAT W 665 63.406 52.047 -0.358 1.00 26.92 W 

ATOM 8826 OH2 WAT W 666 37.221 36.481 18.964 1.00 39.54 W 

15 ATOM 8827 OH2 WAT W 667 57.641 36.009 -2.585 1.00 36.41 W 

ATOM 8828 OH2 WAT W 668 23.115 48.814 -36.297 1.00 32.39 W 

ATOM 8829 OH2 WAT W 669 17.636 71.546 -29.530 1.00 35.34 W 

ATOM 8830 OH2 WAT W 670 30.674 40.378 38.143 1.00 36.19 W 

ATOM 8831 OH2 WAT W 671 27.433 50.036 23.323 1.00 31.31 W 

20 ATOM 8832 OH2 WAT W 672 30.557 48.124 40.431 1.00 36.97 W 

ATOM 8833 OH2 WAT W 673 59.855 52.908 -30.690 1.00 30.00 W 

ATOM 8834 OH2 WAT W 674 44.853 40.592 13.345 1.00 29.68 W 

ATOM 8835 OH2 WAT W 675 34.115 47.984 36.057 1.00 30.03 W 

ATOM 8836 OH2 WAT W 676 27.358 82.247 3.089 1.00 28.86 W 

25 ATOM 8837 OH2 WAT W 677 58.497 92.398 -42.235 1.00 35.60 W 

ATOM 8838 OH2 WAT W 678 49.253 52.427 -37.624 1.00 38.08 W 

ATOM 8839 OH2 WAT W 679 53.379 70.899 -16.924 1.00 25.65 W 

ATOM 8840 OH2 WAT W 680 57.626 62.695 -15.503 1.00 35.96 W 

ATOM 8841 OH2 WAT W 681 28.297 44.350 -29.713 1.00 28.58 W 

30 ATOM 8842 OH2 WAT W 682 48.852 49.432 -38.154 1.00 36.59 W 

ATOM 8843 OH2 WAT W 683 35.576 77.511 -43.062 1.00 29.02 W 

ATOM 8844 OH2 WAT W 684 9.623 58.535 17.852 1.00 33.51 W 

ATOM 8845 OH2 WAT W 686 61.093 45.523 -3.654 1.00 29.72 W 

ATOM 8846 OH2 WAT W 687 65.483 78.684 -5.700 1.00 27.73 W 

35 ATOM 8847 OH2 WAT W 688 7.036 58.084 14.815 1.00 36.18 W 

ATOM 8848 OH2 WAT W 689 49.432 50.626 32.574 1.00 27.09 W 

ATOM 8849 OH2 WAT W 690 18.651 62.429 22.834 1.00 42.61 W 

ATOM 8850 OH2 WAT W 691 30.258 34.015 21.108 1.00 34.14 W 

ATOM 8851 OH2 WAT W 692 17.405 45.631 -14.008 1.00 26.24 W 

40 ATOM 8852 OH2 WAT W 693 28.725 55.316 39.985 1.00 27.55 W 

ATOM 8853 OH2 WAT W 700 50.875 59.946 -8.928 1.00 11.46 W 

ATOM 8854 OH2 WAT W 701 43.729 63.036 -28.635 1.00 10.90 W 

ATOM 8855 OH2 WAT W 702 38.088 81.789 -31.356 1.00 11.61 W 

ATOM 8856 OH2 WAT W 703 32.349 39.151 -4.925 1.00 10.69 W 

45 ATOM 8857 OH2 WAT W 705 60.345 60.974 -18.133 1.00 10.00 W 

ATOM 8858 OH2 WAT W 706 63.195 58.590 -1.434 1.00 14.50 W 

ATOM 8859 OH2 WAT W 707 60.694 60.822 -3.596 1.00 12.04 W 

ATOM 8860 OH2 WAT W 708 46.601 74.399 7.782 1.00 12.30 W 

ATOM 8861 OH2 WAT W 709 54.966 57.537 -0.483 1.00 13.73 W 

50 ATOM 8862 OH2 WAT W 710 30.225 74.877 -29.275 1.00 13.65 W 

ATOM 8863 OH2 WAT W 712 58.878 56.724 -1.422 1.00 11.48 W 

ATOM 8864 OH2 WAT W 713 58.700 52.597 -0.079 1.00 14.33 W 

ATOM 8865 OH2 WAT W 714 63.107 61.088 -4.839 1.00 12.89 W 

ATOM 8866 OH2 WAT W 716 27.955 48.693 12.047 1.00 17.48 W 

55 ATOM 8867 OH2 WAT W 717 66.623 56.169 -20.267 1.00 17.13 W 
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Table 2 

Structural coordinates of a Drosophila Golgi cc-mannosidase II with swainsonine. 

REMARK coordinates from restrained individual B-factor refinement 
REMARK refinement resolution: 500.0 - 1.87 A 
REMARK starting r- 0.1835 free_r= 0.2089 
REMARK final r- 0.1801 free_r= 0.2084 

REMARK B rmsd for bonded mainchain atoms- 0*707 target- 1.5 
REMARK B rmsd for bonded sidechain atoms- 1*139 target- 2.0 
REMARK B rmsd for angle mainchain atoms- 1.167 target- 2.0 
REMARK B rmsd for angle sidechain atoms= 1.765 target- 2.5 
REMARK wa- 1.14241 
REMARK rweight-0.269445 
REMARK target- mlf steps- 30 

REMARK sg= P2 { 1 } 2 ( 1 ) 2 ( 1 ) a- 68.902 b= 110.015 c= 138.472 alpha= 90 beta- 90 
gamma= 90 

REMARK parameter file 1 : CNSJTOPPAR: protein_rep .param 

REMARK parameter file 2 : CNSJTOPPAR: water_rep.param 

REMARK parameter file 3 : CNSJTOPPAR: ion .param 

REMARK parameter file 4 ; swainsonine2 . par 

REMARK parameter file 5 : . . /zntrmp/mpd.par 

REMARK parameter file 6 : cis_peptide . param 

REMARK parameter file 7 : CNSJTOPPAR: carbohydrate . param 

REMARK molecular structure file: swainsoninegen . mt f 

REMARK input coordinates: swainsonine_ann_l . pdb 

REMARK reflection file- dgm2native_re jmerge . cv 

REMARK ncs- none 

REMARK B-correction resolution: 6.0 - 1.87 
REMARK initial B-factor correction applied to fobs : 
REMARK Bll= 0.513 B22= -0.085 B33- -0.428 
REMARK B12- 0.000 B13- 0.000 B23- 0.000 

REMARK B-factor correction applied to coordinate array B: -0.306 
REMARK bulk solvent: density level- 0.353213 e/A A 3, B-factor- 42.0423 A A 2 
REMARK reflections with | Fobs 1 /sigma_F < 0.0 rejected 
REMARK reflections with | Fobs I > 10000 * rms(Fobs) rejected 

REMARK theoretical total number of refl. in resol. range: 87 64 3 { 100.0 

% ) 

REMARK number of unobserved reflections (no entry or |F)-0): 2814 (3.2%) 
REMARK number of reflections rejected: 0 (0.0%) 
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ATOM 4682 CG2 VAL A 595 49.071 86.551 -27.516 1.00 17.31 A 

ATOM 4683 C VAL A 595 45.382 87.572 -27.895 1.00 16.73 A 

ATOM 4684 O VAL A 595 45.386 88.794 -27.765 1.00 17.05 A 

ATOM 4685 N TRP A 596 44.277 86.838 -27.809 1.00 16.70 A 

5 ATOM 4686 CA TRP A 596 42.979 87.441 -27.541 1.00 17.84 A 

ATOM 4687 CB TRP A 596 41.890 86.779 -28.400 1.00 17.13 A 

ATOM 4688 CG TRP A 596 42.053 86.944 -29.878 1.00 17.49 A 

ATOM 4689 CD2 TRP A 596 41.740 88.111 -30.649 1.00 17.58 A 

ATOM 4690 CE2 TRP A 596 42.029 87.810 -31.999 1.00 17.37 A 

10 ATOM 4691 CE3 TRP A 596 41.243 89.383 -30.329 1.00 18.00 A 

ATOM 4692 CD1 TRP A 596 42.509 86.010 -30.766 1.00 17.26 A 

ATOM 4693 NE1 TRP A 596 42.494 86.521 -32.042 1.00 17.27 A 

ATOM 4694 CZ2 TRP A 596 41.837 88.734 -33.031 1.00 18.13 A 

ATOM 4695 CZ3 TRP A 596 41.050 90.305 -31.359 1.00 18.36 A 

15 ATOM 4696 CH2 TRP A 596 41.348 89.974 -32.692 1.00 18.70 A 

ATOM 4697 C TRP A 596 42.569 87.329 -26.077 1.00 18.84 A 

ATOM 4698 O TRP A 596 42.689 86.267 -25.466 1.00 17.93 A 

ATOM 4699 N SER A 597 42.084 88.434-25.519 1.00 19.67 A 

ATOM 4700 CA SER A 597 41.620 88.451-24.137 1.00 21.40 A 

20 ATOM 4701 CB SER A 597 42.561 89.280 -23.261 1.00 21.75 A 

ATOM 4702 OG SER A 597 42.699 90.589 -23.774 1.00 24.13 A 

ATOM 4703 C SER A 597 40.221 89.055-24.127 1.00 21.80 A 

ATOM 4704 O SER A 597 39.973 90.082-24.764 1.00 22.44 A 

ATOM 4705 N TRP A 598 39.306 88.411 -23.414 1.00 22.12 A 

25 ATOM 4706 CA TRP A 598 37.934 88.888 -23.343 1.00 23.28 A 

ATOM 4707 CB TRP A 598 36.967 87.704 -23.289 1.00 21.46 A 

ATOM 4708 CG TRP A 598 36.940 86.928 -24.570 1.00 19.87 A 

ATOM 4709 CD2 TRP A 598 35.925 86.976 -25.577 1.00 19.13 A 

ATOM 4710 CE2 TRP A 598 36.334 86.122 -26.627 1.00 18.71 A 

30 ATOM 4711 CE3 TRP A 598 34.708 87.662-25.698 1.00 19.08 A 

ATOM 4712 CD1 TRP A 598 37.898 86.070 -25.033 1.00 19.36 A 

ATOM 4713 NE1 TRP A 598 37.540 85.581 -26.269 1.00 18.19 A 

ATOM 4714 CZ2 TRP A 598 35.569 85.934 -27.782 1.00 18.54 A 

ATOM 4715 CZ3 TRP A 598 33.947 87.475 -26.849 1.00 18.32 A 

35 ATOM 4716 CH2 TRP A 598 34.382 86.617 -27.876 1.00 18.60 A 

ATOM 4717 C TRP A 598 37.718 89.802 -22.149 1.00 25.15 A 

ATOM 4718 O TRP A 598 38.204 89.536 -21.051 1.00 25.24 A 

ATOM 4719 N HIS A 599 36.982 90.883 -22.374 1.00 27.74 A 

ATOM 4720 CA HIS A 599 36.728 91.848 -21.319 1.00 30.86 A 

40 ATOM 4721 CB HIS A 599 37.605 93.081 -21.542 1.00 32.48 A 

ATOM 4722 CG HIS A 599 39.068 92.770 -21.604 1.00 34.54 A 

ATOM 4723 CD2 HIS A 599 39.934 92.763 -22.645 1.00 35.10 A 

ATOM 4724 ND1 HIS A 599 39.793 92.372 -20.500 1.00 35.25 A 

ATOM 4725 CE1 HIS A 599 41.042 92.135 -20.859 1.00 35.53 A 

45 ATOM 4726 NE2 HIS A 599 41.154 92.364 -22.156 1.00 35.52 A 

ATOM 4727 C HIS A 599 35.271 92.268 -21.242 1.00 32.18 A 

ATOM 4728 O HIS A 599 34.606 92.441 -22.264 1.00 31.64 A 

ATOM 4729 N HIS A 600 34.779 92.420 -20.019 1.00 34.30 A 

ATOM 4730 CA HIS A 600 33.407 92.846 -19.803 1.00 36.52 A 

50 ATOM 4731 CB HIS A 600 32.895 92.352 -18.449 1.00 38.40 A 

ATOM 4732 CG HIS A 600 31.437 92.609 -18.225 1.00 40.80 A 

ATOM 4733 CD2 HIS A 600 30.432 91.782 -17.852 1.00 41.76 A 

ATOM 4734 ND1 HIS A 600 30.868 93.855 -18.384 1.00 41.71 A 

ATOM 4735 CE1 HIS A 600 29.575 93.784 -18.120 1.00 42.34 A 

55 ATOM 4736 NE2 HIS A 600 29.285 92.537 -17.794 1.00 42.76 A 
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C 
C 

c 
c 
c 
c 
c 
c 
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5 



ATOM 


9192 


C2 


MPD 


M 


1 


16 


264 


61 


.479 


10. 


614 


1 


.00 


23 


60 


ATOM 


9193 


02 


MPD 


M 


1 


16 


876 


60 


.330 


9. 


851 


1 


.00 


24 


23 


ATOM 


9194 


CM 


MPD 


M 


1 


17 


075 


62 


.611 


10. 


000 


1 


.00 


24 


36 


ATOM 


9195 


C3 


MPD 


M 


1 


16 


492 


61 


.271 


12. 


125 


1 


.00 


23 


19 


ATOM 


9196 


C4 


MPD 


M 


1 


17 


813 


60 


.945 


12. 


649 


1 


.00 


23 


12 


ATOM 


9197 


04 


MPD 


M 


1 


17 


580 


59 


.936 


13. 


627 


1 


.00 


22 


00 


ATOM 


9198 


C5 


MPD 


M 


1 


18 


337 


62 


.222 


13. 


387 


1 


.00 


22 


61 


ATOM 


9199 


ZN 


ZN 


Z 


1 


34 


561 


64 


.335 


8. 


062 


1 


.00 


15 


34 



END 



10 
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Table 3 

Structural Results: Active Site 



From initial analysis of the crystal structure, based on the location of residues known to be 
involved in catalysis, the active site of the enzyme is located within a region composed of the 
following residues: 



uLY 


60 


VAL 


61 


1RP 


62 


T \/C 

LYS 


63 


GLN 


64 


/^T XT' 

CjLY 


65 


1LE 


£10 

68 


"V 7 A T 

VAL 


83 


PHE 


O A 

84 


VAL 


85 


\ T A T 

VAL 


86 


PRO 


87 


HIS 


88 


SLR 


89 


T TTO 

HIS 


AA 

90 


A C\T 

ASN 


91 


a cn 

ASP 


92 


IRP 


95 


ILE 


110 


met 


124 


PHE 


126 


ILE 


127 


TRP 


128 


ala 


129 


GLU 


130 


VAL 


I6l 


MET 


167 


TRP 


201 


AT A 

ALA 


202 


TT T~* 

ILE 


203 


Abr 


2U4 


rKO 


205 


rnb 


206 


GLY 


ZU/ 


rili> 


2Uo 


T DI T 

LEU 


225 


ILE 


226 


GLN 


227 


ARG 


228 


thr 


229 


TYR 


23 1 


LYS 


234 


LEU 


243 


HIS 


262 


MET 


263 


MET 


264 


PRO 


265 


PHE 


266 


TYR 


267 


SER 


268 


TYR 


269 


ASP 


270 


ILE 


271 


PRO 


272 


HIS 


273 


THR 


274 


CYS 


275 


GLY 


276 


PRO 


277 


ASP 


278 


PRO 


279 


LYS 


280 


val 


28 1 


CYS 


282 


CYS 


283 


GLN 


284 


PHE 


285 


ASP 


286 


PHE 


287 


LYS 


288 


ARG 


289 


met 


290 


phe 


293 


gly 


294 


leu 


295 


ser 


296 


CYS 


297 


PRO 


298 


TRP 


299 


lys 


300 


VAL 


301 


PRO 


302 


PRO 


303 


leu 


317 


LEU 


318 


GLN 


321 


trp 


322 


LYS 


324 


LYS 


325 


ala 


326 


LEU 


328 


tyr 


329 


LEU 


334 


LEU 


335 


ile 


336 


PRO 


337 


LEU 


338 


GLY 


339 


ASP 


340 


ASP 


341 


PHE 


342 


ARG 


343 


phe 


344 


lys 


345 


GLU 


349 


val 


352 


GLN 


353 


arg 


354 


TYR 


357 


LEU 


360 


PHE 


361 


PHE 


376 


GLY 


377 


LEU 


379 


TYR 


407 


ALA 


408 


ASP 


409 


ARG 


410 


trp 


415 


HIS 


47 1 


ASP 


472 


TYR 


727 


ASP 


874 


glu 


875 


ARG 


876 


GLY 


877 



LEU 878 



Residues in lower case are not identical in the human Golgi Manll sequence, though many of 
those are conservative substitutions. In this sphere of ISA around the catalytic center, there 
are only 21 non-identities among 121 residues, indicating that the human active site will be 
essentially identical to that observed for the Drosophila structure. 

The numbering system in this and the accompanying Figures corresponds to the protein 
expressed in the system described herein, not to full length ManIL 
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Table 4 

Intermolecular Interactions at an Active Site of a Mannosidase II Swainsonine Complex. 



No. of Atomic 


Swainsonine 


Enzyme 


Distance 


Atomic 


Interaction 


Atomic 


Atomic 


Between 


Interaction 




Contact 


Contact 


Atomic 


Property 








Contacts 




1 


01 


His 471NE2 


3.3 


HB 


9 


N 


Asp 204 0D1 


2.9 


HB 


J 




Asd 341 0D2 


3.4 


HB 


A 




His 90 NE2 


3.1 


HB 


5 


02 


Asp 92 OD1 


2.9 


HB 


6 


08 


Asp 472 0D1 


2.53 


HB 


7 


Ring 


Phe 206 ring 


3.55 


VW 


8 


08 


Tyr 727 OH 


2.6 


HB 


9 


ring 


Trp95 


3.7 


VW 



HB: hydrogen bond interaction 
VW VanderWaals 
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Table 5 

Crystallographic Refinement Statistics for the Native Drosophila Gogli 
Marino sidase II 

>» input coordinates: dgml2_ann_lbi . pdb 
»> molecular structure file: dgm!2gen .mtf 



»> parameter file 1 
»> parameter file 2 
»> parameter file 3 
»> parameter file 4 
>>> parameter file 5 
>» parameter file 6 
»> parameter file 7 
»> reflection file- 
»> spacegroup: P2 (1) 2 (1) 2 (1) 
>» cell dimensions: a= 68.865 b : 



CNSJTOPPAR: protein_rep • param 
CNSJTOPPAR : water jrep . param 
CNSJTOPPAR: ion. param 
trs .par 
mpdnew2 . par 
cis_pept ide . param 
CNSJTOPPAR: carbohydrate . param 
. /semetHiR.cv 



109.718 c= 138.599 alpha= 90 beta= 90 



gamma- 90 

»> current wa- 0.311868 for target- mlf 



>» 
»> 
»> 
>» 
»> 



ncs= none 

initial B-factor correction applied to fobs : 
Bll- 0.587 B22- -0.754 B33- 0.167 
B12= 0.000 B13- 0.000 B23- 0.000 

B-factor correction applied to coordinate array B: 



0.088 



>>> B-correction resolution: 6.0 
»> bulk solvent: density level= 



1.' 



0.359928 e/A A 3, B-factor- 43.085 A A 2 



======== summary 



30 resolution range: 500.0 - 1.4 A 
R-values : 
initial 

after B-factor and/or bulk solvent correction 

35 Monitor for target "mlf" is R-value : 

working set- 0.1931 test set- 0.2105 

luzzati coordinate error (5.0 

cross-validated luzzati coordinate error (5.0 

40 sigmaa coordinate error (5.0 

cross-validated sigmaa coordinate error (5.0 

rmsd bonds- 0.0047 63 with 2 bond violations > 0.05 
rmsd angles- 1.32400 with 32 angle violations > 8.0 
45 rmsd dihedrals- 24.22585 with 4 angle violations > 60. 
rmsd improper- 0.80790 with 38 angle violations > 3.0 

================================== B-f actors 

50 

average B-factor- 15.7963 
minimum B-factor= 4.5681 
maximum B-factor- 57.2481 

B rmsd for bonded mainchain atoms- 0.660 



r- 0.2348 free_r- 0.2361 
r- 0.1931 free r= 0.2105 



- 1.4 A ) : 0.1b A 

- 1.4 A ) : 0.18 A 

- 1.4 A ) : 0.08 A 

- 1.4 A ) : 0.10 A 
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B rmsd for bonded sidechain atoms= 1.229 
B rmsd for angle mainchain atoms= 1.131 
B rmsd for angle sidechain atoms= 1.777 
current rweight- 0.22677665 



diffraction data 



reflections with | Fobs | /sigma_F < 0.0 rejected 
10 reflections with i Fobs j > 10000 * rms(Fobs) rejected 

theoretical total number of refl. in resol. range: 

number of unobserved reflections (no entry or |F|=0): 

number of reflections rejected: 

total number of reflections used: 
15 number of reflections in working set: 

number of reflections in test set: 



206243 ( 100.0 



59797 ( 

0 { 

146446 { 

139067 ( 

7379 ( 



29.0 
0.0 
71.0 
67.4 
3.6 



=> completeness 



20 



25 



30 



Test set {test = 1) 



#bin | resolution range | #refl 



1 


3. 


02 


500.01 


1113 


0.0521 


2 


2. 


39 


3.02 


1042 


0.0501 


3 


2. 


09 


2.39 


1048 


0.0508 


4 


1. 


90 


2.09 


1000 


0.0485 


5 


1 . 


76 


1.90 


989 


0.0482 


6 


1. 


66 


1.76 


822 


0.0401 


7 


1. 


58 


1. 66 


584 


0.0285 


8 


1. 


51 


1.58 


419 


0.0205 


9 


1. 


45 


1.51 


265 


0.0130 


10 


1. 


40 


1.45 


97 


0.0048 



35 



40 



45 



Working set: 



#bin 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



reso 
3.02 
2.39 
2.09 
1.90 
1.76 
1.66 
1.58 
1.51 
1. 45 
1.40 



lution range 
500.01 



02 
39 
09 
90 
76 
66 
58 
51 
45 



I #refl 
20201 
19732 
19506 
19199 
18521 
15253 
11380 
8211 
5065 
1999 



0.9460 
0.9491 
0.9447 
0. 9316 
0.9019 
0.7434 
0.5551 
0.4018 
0.2477 
0.0980 



50 



55 



—=-==-> R- values with I Fobs t /sigma cutoff = 0.0 
Test set (test = 1) : 
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10 



#bin 
1 
2 
3 
4 
5 



9 
10 



| resolution range 1 #refl I 



,02 
,39 
.09 
,90 



1.76 
1.66 
1.58 
1.51 
1.45 
1.40 



500.01 
3.02 
2.39 
2.09 



90 
76 
66 
58 
51 
45 



1113 
1042 
1048 
1000 
989 
822 
584 
419 
265 
97 



0.1999 
0.2110 
0.2083 
0.2137 
0.2244 
0.2302 
0.2275 
0.2363 
0.2186 
0.2706 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Working set; 
#bin I resolution range 



#refl 



1 


3 


.02 


500. 


01 


20201 


0.1858 


2 


2 


.39 


3. 


02 


19732 


0.1989 


3 


2 


.09 


2. 


39 


19506 


0.1935 


4 


1 


.90 


2. 


09 


19199 


0.1922 


5 


1 


.76 


1. 


90 


18521 


0.1934 


6 


1 


. 66 


1. 


76 


15253 


0.1973 


7 


1 


.58 


1. 


66 


11380 


0.1989 


8 


1 


.51 


1. 


58 


8211 


0.2067 


9 


1 


.45 


1. 


51 


5065 


0.2248 


10 


1 


.40 


1. 


45 


1999 


0.2589 



== sigmaa-values 



sigmaa calculated using cross-validated data (test set) 
number of bins for sigmaa calculation^ 50 



#bin | resolution range 



1 


5. 


16 


500.01 


2 


4 . 


09 


5.16 


3 


3. 


58 


4.09 


4 


3. 


25 


3.58 


5 


3. 


02 


3.25 


6 


2. 


84 


3.02 


7 


2. 


70 


2.84 


8 


2. 


.58 


2.70 


9 


2. 


48 


2.58 


10 


2. 


.39 


2.48 


11 


2 . 


.32 


2.39 


12 


2. 


.25 


2.32 


13 


2. 


.19 


2.25 


14 


2. 


. 14 


2.19 


15 


2. 


.09 


2. 14 


16 


2. 


.05 


2.09 


17 


2 


.01 


2.05 


18 


1 


. 97 


2.01 


19 


1 


. 93 


1.97 


20 


1 


.90 


1.93 


21 


1 


.87 


1.90 


22 


1 


.84 


1.87 



I #refl 1 
4426 
4270 
4227 
4196 
4195 
4177 
4163 
4152 
4156 
4126 
4131 
4149 
4101 
4113 
4060 
4096 
4078 
4005 
4001 
4019 
3971 
3941 



0. 9319 
0. 9576 
0.9615 
0.9632 
0. 9422 
0.9512 
0.9369 
0.9424 
0. 9486 
0. 9448 
0. 9407 
0. 9498 
0. 9417 
0.9351 
0. 9459 
0. 9333 
0. 9552 
0. 9445 
0. 9443 
0.9352 
0.9411 
0.9319 
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23 


1.81 


1.84 


3946 


0 . 9360 


24 


1.79 


1.81 


3860 


0 . 94 80 


25 


1.76 


1.79 


3792 


0 . 9213 


26 


1.74 


1.76 


3632 


0 . 9184 


27 


1.72 


1.74 


3458 


0 . 9382 


28 


1.70 


1.72 


3191 


0.9414 


29 


1. 68 


1.70 


2994 


0 . 9275 


30 


1.66 


1.68 


2800 


0 . 9268 


31 


1.64 


1.66 


2678 


0 . 9303 


32 


1. 62 


1.64 


2506 


0. 9388 


33 


1.61 


1.62 


2366 


0 . 9506 


34 


1.59 


1.61 


2249 


0. 9402 


35 


1.58 


1.59 


2165 


0 . 9270 


36 


1.56 


1.58 


2006 


0. 9365 


37 


1.55 


1.56 


1874 


0. 9184 


38 


1.53 


1.55 


1700 


0 . 9072 


39 


1.52 


1.53 


1595 


0. 9358 


40 


1.51 


1.52 


1455 


0.9398 


41 


1.50 


1.51 


1336 


0. 9433 


42 


1.48 


1.50 


1147 


0. 9541 


43 


1.47 


1.48 


1077 


0.9419 


44 


1.46 


1.47 


962 


0 . 9331 


45 


1.45 


1.46 


808 


0.9242 


46 


1.44 


1.45 


696 


0.9038 


47 


1.43 


1.44 


570 


0.9371 


48 


1.42 


1.43 


408 


0.9387 


49 


1.41 


1.42 


288 


0.8914 


50 


1.40 


1.41 


134 


0.8194 



non-trans peptides 



cis-peptide: segid^A 
current 

cis-peptide: segid=A 
current 



resid-406 resname=THR 
dihedral value- -0.010 

resid=532 resname=PRO 
dihedral valuer -0.435 



no atoms have zero occupancy 
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Table 6 

Crystallographic Refinement Statistics 
II Associated with Swainsonine 



for the Drosophila Golgi Mannosidase 



>>> input coordinates: swainsonine3_ann_lbi . pdb 
»> molecular structure file: swainsoninegen3 .mtf 



»> parameter file 
»> parameter file 
»> parameter file 
>>> parameter file 
>» parameter file 
»> parameter file 
»> parameter file 



1 : CNSJTOPPAR:protein_rep.param 

2 : CNSJTOPPAR:water_rep.param 

3 : CNS_TOPPAR:ion.param 

4 : swa.par 

5 : . . /zntrmp/mpdnew2.par 

6 : cis_peptide.param 

7 : CNSJTOPPAR: carbohydrate. param 
»> reflection file= dgm2native_rejmerge. cv 

»> spacegroup: P2 ( 1 ) 2 ( 1 ) 2 ( 1 ) 

»> cell dimensions: a- 68.902 b= 110.015 c= 138.472 alpha= 90 beta= 90 
gamma= 90 

»> current wa= 0.6324 64 for target^ mlf 
>>> ncs= none 

>>> initial B-factor correction applied to fobs : 
»> Bll= 0.551 B22- -0.116 B33= -0.435 
>» B12= 0.000 B13- 0.000 B23- 0.000 

»> B-factor correction applied to coordinate array B: 0.413 
>>> B-correction resolution: 6.0 - 1.87 

»> bulk solvent: density level= 0.354131 e/A"3, B-factor= 42.2797 A A 2 



summary 



resolution range: 500.0 - 1.87 A 
R-values : 
initial 

after B-factor and/or bulk solvent correction 



r= 0.2078 free_r= 0.2371 
r= 0.1810 free r= 0.2090 



Monitor for target "mlf" is R-value : 
working set= 0.1810 test set- 0.2090 



luzzati coordinate error (5.0 - 1.87 A ) 

cross-validated luzzati coordinate error (5.0 - 1.87 A ) 

sigmaa coordinate error (5.0 - 1.87 A ) 

cross-validated sigmaa coordinate error (5.0 - 1.87 A ) 



0.19 A 
0.22 A 
0.12 A 
0.14 A 



rmsd bonds= 0.005230 with 2 bond violations > 0.05 
rrasd angles= 1.31525 with 34 angle violations > 8.0 
rmsd dihedrals^ 24.18605 with 4 angle violations > 60.0 
rmsd improper^ 0.78741 with 34 angle violations > 3.0 



average B-factor= 19.441 
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minimum B-factor= 8.18268 
maximum B-factor= 64.5527 

B rmsd for bonded mainchain atoms= 0.699 
B rmsd for bonded sidechain atoras= 1.141 
B rmsd for angle mainchain atoms= 1.167 
B rmsd for angle sidechain atoms= 1.747 
current rweight= 0.18691688 



10 



15 



20 



diffraction data 



reflections with | Fobs 1 /sigma_F < 0.0 rejected 
reflections with | Fobs j > 10000 * rms(Fobs) rejected 
theoretical total number of refl. in resol. range: 
number of unobserved reflections (no entry or jF|=0): 
number of reflections rejected: 
total number of reflections used: 
number of reflections in working set: 
number of reflections in test set: 



completeness 
Test set (test = 1) : 



87643 
2814 
0 

84829 
80543 
4286 



( 100.0 % 
( 3.2 % 
( 0.0 % 
( 96.8 % 
( 91.9 % 
( 4.9 % 



25 


#bin 


resolution range 


#refl t 






1 


4.03 


500.01 


452 


0. 0493 




2 


3.20 


4 .03 


488 


0.0550 




3 


2.79 


3.20 


398 


0.0453 




4 


2.54 


2.79 


433 


0.0496 


30 


5 


2. 36 


2.54 


460 


0.0527 




6 


2.22 


2.36 


433 


0.0497 




7 


2.11 


2.22 


417 


0.0480 




8 


2.01 


2.11 


402 


0.0464 




9 


1. 94 


2.01 


406 


0.0470 


35 


10 


1.87 


1.94 


397 


0.0457 



Working set: 



40 



45 



50 



in 


| resolution range | 


#refl i 




1 


4.03 


500.01 


8521 


0.9298 


2 


3.20 


4.03 


8330 


0.9385 


3 


2.79 


3.20 


8244 


0.9391 


4 


2.54 


2.79 


8134 


0.9315 


5 


2.36 


2.54 


8056 


0. 9234 


6 


2.22 


2.36 


8022 


0. 9215 


7 


2.11 


2.22 


7985 


0.9196 


8 


2.01 


2.11 


7940 


0.9158 


9 


1.94 


2.01 


7835 


0.9077 


10 


1.87 


1.94 


7476 


0.8615 










R-values 
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=> R-values with |Fobs|/sigma cutoff= 0.0 
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Test 


set (test 


= 1) : 








#bin 


| resolution range I 


\ #refl i 






1 


4.03 500.01 


452 


0. 


2028 


2 


3.20 


4.03 


488 


0. 


1836 


3 


2.79 


3.20 


398 


0 . 


2059 


4 


2.54 


2.79 


433 


0. 


2246 


5 


2.36 


2.54 


460 


0. 


2203 


6 


2.22 


2.36 


433 


0. 


2125 


7 


2.11 


2.22 


417 


0. 


2304 


8 


2.01 


2.11 


402 


0. 


2066 


9 


1.94 


2.01 


406 


0. 


2154 


10 


1.87 


1.94 


397 


0. 


2693 



Working set: 



#bin 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



resolution range 
4.03 500.01 
4.03 



3.20 
2.79 
2.54 
2.36 



22 
11 
01 
94 
87 



20 
79 
54 
36 
22 
11 
01 
94 



#refl j 

8521 

8330 

8244 

8134 

8056 

8022 

7985 

7940 

7835 

7476 



0.1673 
0.1704 
0.1861 
0.1857 
0.1830 
0.1827 
0.1894 
0.1826 
0.1920 
0.2320 



30 



sigmaa-values 



35 



40 



45 



50 
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sigmaa calculated using cross-validated data (test set) 
number of bins for sigmaa calculation^ 50 



#bin | 
1 
2 
3 
4 
5 
6 
7 



10 
11 
12 
13 
14 
15 
16 
17 
18 
19 



resolution 
6.89 500 



47 
78 
34 
03 
79 
60 
44 
31 
20 
10 
01 
93 
86 
2.79 
2.73 
2.68 
2.63 
2.58 



6. 
5. 
4 . 
4 . 

4 . 
3. 
3. 
3. 
3. 
3. 
3. 
3. 
2. 
2. 
2. 
2. 
2. 
2. 



range 
01 
89 
47 
78 
34 
03 
79 
60 
44 
31 
20 
10 
01 
93 
86 
79 
73 
68 
63 



#refl i 

1778 

1834 

1793 

1797 

1771 

1778 

1751 

1749 

1775 

1765 

1714 

1746 

1737 

1721 

1724 

1727 

1708 

1726 

1737 



0.9682 
0.9324 
0.9330 
0.9551 
0.9634 
0.9700 
0.9563 
0.9651 
0.9515 
0. 9589 
0. 9433 
0.9571 
0.9545 
0.9457 
0.9391 
0. 9327 
0.9429 
0.9267 
0.9416 
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0 . 9407 
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0 . 9234 


20 


39 


2.03 
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0.9534 
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1634 


0.9255 
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1652 


0.9181 
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1.90 


1.91 


1616 


0.9194 
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49 


1.88 


1.90 


1597 


0.9026 




50 


1.87 


1.88 


1374 


0.8987 
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M< 35 



40 



non-trans peptides 



cis-peptide: segid=A resid=4 06 resname=THR 

current dihedral value= 0.298 

cis-peptide: segid=A resid=532 resname=PRO 

current dihedral valuer -0.154 



45 



occupancies 



no atoms have zero occupancy 



Table 7 



Glycoside Hydrolase Classification 



iSSSSS 

SSESSSSSSSiSflH 



CAZy Family Glycoside Hydrolase Family 38 

. Known Activities a-mannosidase (EC 3.2.1.24) (EC 3.2.1114). 

Mechanism Retaining 

Catalytic . 

Nucleophile/Base Asp 

Catalytic Proton Donor Not known 

3D Structure Status Not available 

Relevant Links InterPro; PFAM 

Statistics CAZy(48); GenBank/GenPept (108); Swissprot (31) 




Arahidopsis imlim&:" 



n.d 



&^^rio5ida& ^ Bos tatMs^ 



3.2:1:24 



ORFP5501CL1 



Oaahorhabdltis elegam 



^BpS230BABt1t2$.1 

* L313?3&A8S#3,1 Q2<*451 
\ U97fe8imc48763;i ' O0284S 
* liS7687AAb4876^1 019138 
U&7fi88MC48?B;3.1 

uS7£9#AAb4876ii 

> U87§j31 AAC48763.1 
lU&7€92AAC487€3.1 
* ; US7e93AAQ4S7^.f 
U97gS4AAb4876ai 
; };if^§i$^5m454: ^ Qaf|74^ 
" "%^^ANmm}i Q20&29 



<c^na^npsidase 
^:W||ndsyasr^ 



Camv&Ka hmtfocmis 
l^MosMi^m <$i$cbfdeum..;f. 



a-mannosidase (a-Man-ltb) i BcosophitamQtanogmter % 



ORF BcDNA,GHG2419 

ORF£G5322 . 
ORFCG9463 
ORFii$G9465 
ORF CG9466 
ORFCG9468 



Drosophfla metanogaster 

QrQSQphftamekrhgaster 
Drosophtfa melanogaster 
Drosophite metanogaster 
Drosophila melanogaster 
Drosophtfa melanogaster 



ad 

3.2.124 j ' 

3.2.1114 ' AEO03?1CAA£5522&1 ' * 

A8O18079BAA75817J 
3.2.1314 ^AS6d3652 AAR^37^4^ &24451 

; >; ^^^caw732:i ^ ^ 

ad AF145601 AAD38576.1 
AE003628 AAF52958.1 

ad / ,AE00^8 AAF529S7.1 
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n.d. AE003622 AAF52710.1 
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ORF YbgB 
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n.d D90713 BAA35398.1 
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AF010192MB97733.1 
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a-mannosidase (lysosomal Fete catus 
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T 7/ w - 
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* /--.r^ AF04^2AAC7856af ^ 

3.2.1.114 X61172CAA43480.1 P27046 
ad r AF107G18 AAD20813.1 

n.d, Z92772 CAB07105.1 P96937 

n.d. AP000003BAA29929.1 OS8i^5 

3,2.124 M57547AAA4 1565.1 P21139 



fe-nlannosrdase 2 
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Rattas norvegicus t 3.2.1114 

Saccharomyces cerevisiae 3.2 . 1 .24 
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Susscrofa 3.2.1.24 
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ATOM 120 CB LEU A 46 22.781 49.009 -12.317 1.00 0.00 C 

ATOM 121 CG LEU A 46 22.024 49.702 -13.459 1.00 0.00 C 

ATOM 122 CD1 LEU A 46 22.010 51.211 -13.228 1.00 0.00 C 

ATOM 123 CD2 LEU A 46 22.686 49.379 -14.791 1.00 0.00 C 

5 ATOM 124 N GLU A 47 20.868 46.963 -10.890 1.00 0.00 N 

ATOM 125 CA GLU A 47 19.683 46.112 -10.925 1.00 0.00 C 

ATOM 126 C GLU A 47 18.826 46.412 -9.702 1.00 0.00 C 

ATOM 127 O GLU A 47 17.609 46.566 -9.807 1.00 0.00 O 

ATOM 128 CB GLU A 47 20.086 44.632 -10.946 1.00 0.00 C 

10 ATOM 129 CG GLU A 47 18.929 43.665 -11.196 1.00 0.00 C 

ATOM 130 CD GLU A 47 18.116 43.357 -9.952 1.00 0.00 C 

ATOM 131 OE1 GLU A 47 17.007 42.796 -10.093 1.00 0.00 O 

ATOM 132 OE2 GLU A 47 18.582 43.657 -8.832 1.00 0.00 O 

ATOM 133 N LEU A 48 19.467 46.514 -8.542 1.00 0.00 N 

15 ATOM 134 CA LEU A 48 18.757 46.813 -7.302 1.00 0.00 C 

ATOM 135 C LEU A 48 18.022 48.144 -7.416 1.00 0.00 C 

ATOM 136 O LEU A 48 16.861 48.258 -7.026 1.00 0.00 O 

ATOM 137 CB LEU A 48 19.737 46.869 -6.125 1.00 0.00 C 

ATOM 138 CG LEU A 48 19.124 47.144 -4.747 1.00 0.00 C 

20 ATOM 139 CD1 LEU A 48 18.094 46.072 -4.423 1.00 0.00 C 

ATOM 140 CD2 LEU A 48 20.216 47.174 -3.688 1.00 0.00 C 

ATOM 141 N TYR A 49 18.698 49.149 -7.962 1.00 0.00 N 

ATOM 142 CA TYR A 49 18.097 50.470 -8.120 1.00 0.00 C 

ATOM 143 C TYR A 49 16.855 50.421 -9.002 1.00 0.00 C 

25 ATOM 144 O TYR A 49 15.883 51.141 -8.771 1.00 0.00 O 

ATOM 145 CB TYR A 49 19.126 51.449 -8.704 1.00 0.00 C 

ATOM 146 CG TYR A 49 19.677 52.405 -7.670 1.00 0.00 C 

ATOM 147 CD1 TYR A 49 20.153 51.934 -6.444 1.00 0.00 C 

ATOM 148 CD2 TYR A 49 19.672 53.783 -7.892 1.00 0.00 C 

30 ATOM 149 CE1 TYR A 49 20.602 52.812 -5.458 1.00 0.00 C 

ATOM 150 CE2 TYR A 49 20.119 54.671 -6.913 1.00 0.00 C 

ATOM 151 CZ TYR A 49 20.578 54.180 -5.700 1.00 0.00 C 

ATOM 152 OH TYR A 49 20.977 55.057 -4.717 1.00 0.00 O 

ATOM 153 N ASP A 50 16.888 49.553 -10.005 1.00 0.00 N 

35 ATOM 154 CA ASP A 50 15.768 49.407 -10.925 1.00 0.00 C 

ATOM 155 C ASP A 50 14.532 48.872 -10.192 1.00 0.00 C 

ATOM 156 O ASP A 50 13.409 49.299 -10.465 1.00 0.00 O 

ATOM 157 CB ASP A 50 16.158 48.452 -12.058 1.00 0.00 C 

ATOM 158 CG ASP A 50 15.327 48.653 -13.311 1.00 0.00 C 

40 ATOM 159 OD1 ASP A 50 15.409 47.790 -14.212 1.00 0.00 O 

ATOM 160 OD2 ASP A 50 14.609 49.672 -13.403 1.00 0.00 O 

ATOM 161 N ARG A 51 14.749 47.951 -9.256 1.00 0.00 N 

ATOM 162 CA ARG A 51 13.660 47.333 -8.495 1.00 0.00 C 

ATOM 163 C ARG A 51 13.163 48.089 -7.262 1.00 0.00 C 

45 ATOM 164 O ARG A 51 11.986 48.002 -6.920 1.00 0.00 O 

ATOM 165 CB ARG A 51 14.057 45.914 -8.062 1.00 0.00 C 

ATOM 166 CG ARG A 51 14.171 44.923 -9.198 1.00 0.00 C 

ATOM 167 CD ARG A 51 14.347 43.487 -8.698 1.00 0.00 C 

ATOM 168 NE ARG A 51 15.636 43.257 -8.047 1.00 0.00 N 

50 ATOM 169 CZ ARG A 51 15.881 43.459 -6.756 1.00 0.00 C 

ATOM 170 NH1 ARG A 51 14.923 43.899 -5.951 1.00 0.00 N 

ATOM 171 NH2 ARG A 51 17.092 43.219 -6.269 1.00 0.00 N 

ATOM 172 N MET A 52 14.050 48.818 -6.592 1.00 0.00 N 

ATOM 173 CA MET A 52 13.680 49.560 -5.385 1.00 0.00 C 

55 ATOM 174 C MET A 52 12.640 50.646 -5.618 1.00 0.00 C 

ATOM 175 O MET A 52 12.600 51.260 -6.684 1.00 0.00 O 

ATOM 176 CB MET A 52 14.923 50.187 -4.754 1.00 0.00 C 

ATOM 177 CG MET A 52 15.887 49.188 -4.155 1.00 0.00 C 

ATOM 178 SD MET A 52 17.401 50.006 -3.610 1.00 0.00 S 

60 ATOM 179 CE MET A 52 16.765 51.029 -2.290 1.00 0.00 C 

ATOM 180 N SER A 53 11.811 50.891 -4.605 1.00 0.00 N 
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61.574 9.632 1.00 0.00 0 

58.906 8.207 1.00 0.00 C 

57.512 8.393 1.00 0.00 C 

58.851 8.553 1.00 0.00 C 

57.084 9.857 1.00 0.00 C 

62.191 8.227 1.00 0.00 N 

63.553 8.088 1.00 0.00 C 

64.160 9.488 1.00 0.00 C 

64.771 9.813 1.00 0.00 O 

64.416 7.234 1.00 0.00 C 

64.403 7.740 1.00 0.00 0 

63.980 10.312 1.00 0.00 N 

64.482 11.684 1.00 0.00 C 

63.692 12.565 1.00 0.00 C 

64.278 13.339 1.00 0.00 0 
64.396 12.305 1.00 0.00 C 
65.570 12.003 1-00 0.00 C 
65.372 11.460 1.00 0.00 C 
66.877 12.281 1.00 0.00 C 
66.447 11.200 1.00 0.00 C 
67.959 12.027 1.00 0.00 C 
67.738 11.488 1.00 0.00 C 
68.808 11.250 1.00 0.00 O 
62.366 12.452 1.00 0.00 N 
61.541 13.284 1.00 0.00 C 
61.830 13.058 1.00 0.00 C 
61.898 14.012 1.00 0.00 0 
60.050 13.054 1.00 0.00 C 
59.182 14.167 1.00 0.00 C 
59.015 15.335 1.00 0.00 C 
58.594 14.082 1.00 0.00 C 

58.279 16.406 1.00 0.00 C 
57.859 15.145 1.00 0.00 C 
57.701 16.312 1.00 0.00 C 
61.991 11.796 1.00 0.00 N 
62.270 11.469 1.00 0.00 C 
63.619 12.060 1.00 0.00 C 
63.759 12.598 1.00 0.00 0 
62.265 9.957 1.00 0.00 C 
64.602 11.958 1.00 0.00 N 
65.945 12.489 1.00 0.00 C 
65.867 13.998 1.00 0.00 C 
66.527 14.540 1.00 0.00 O 
66.829 12.213 1.00 0.00 C 
68.221 12.856 1.00 0.00 C 
69.245 11.990 1.00 0.00 C 
70.593 12.559 1.00 0.00 N 
70.966 13.668 1.00 0.00 C 
70.095 14.318 1.00 0.00 N 
72.198 14.139 1.00 0.00 N 
65.044 14.665 1.00 0.00 N 
64.856 16.109 1.00 0.00 C 
64.115 16.520 1.00 0.00 C 
64.599 17.336 1.00 0.00 0 
64.065 16.587 1.00 0.00 C 
63.817 18.063 1.00 0.00 C 
64.844 18.948 1.00 0.00 C 
62.555 18.572 1.00 0.00 C 
64.619 20.319 1.00 0.00 C 
62.319 19.947 1.00 0.00 C 
63.356 20.819 1.00 0.00 C 
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ATOM 974 C LYS A 145 56.698 58.793 22.912 1.00 0.00 C 

ATOM 975 O LYS A 145 55.735 58.046 22.752 1.00 0.00 O 

ATOM 976 CB LYS A 145 56.168 60.958 24.078 1.00 0.00 C 

ATOM 977 CG LYS A 145 55.887 61.635 25.417 1.00 0.00 C 

5 ATOM 978 CD LYS A 145 54.925 60.800 26.252 1.00 0.00 C 

ATOM 979 CE LYS A 145 54.648 61.430 27.613 1.00 0.00 C 

ATOM 980 NZ LYS A 145 55.860 61.462 28.478 1.00 0.00 N 

ATOM 981 N LYS A 146 57.640 58.966 21.986 1.00 0.00 N 

ATOM 982 CA LYS A 146 57.589 58.225 20.728 1.00 0.00 C 

10 ATOM 983 C LYS A 146 57.650 56.734 21.036 1.00 0.00 C 

ATOM 984 O LYS A 146 56.967 55.935 20.401 1.00 0.00 O 

ATOM 985 CB LYS A 146 58.761 58.602 19.810 1.00 0.00 C 

ATOM 986 CG LYS A 146 58.624 59.956 19.127 1.00 0.00 C 

ATOM 987 CD LYS A 146 59.760 60.201 18.141 1.00 0.00 C 

15 ATOM 988 CE LYS A 146 59.594 61.542 17.439 1.00 0.00 C 

ATOM 989 N2 LYS A 146 60.713 61.808 16.502 1.00 0.00 N 

ATOM 990 N LEU A 147 58.470 56.369 22.019 1.00 0.00 N 

ATOM 991 CA LEU A 147 58.618 54.976 22.423 1.00 0.00 C 

ATOM 992 C LEU A 147 57.332 54.452 23.064 1.00 0.00 C 

20 ATOM 993 O LEU A 147 56.913 53.328 22.792 1.00 0.00 O 

ATOM 994 CB LEU A 147 59.795 54.828 23.394 1.00 0.00 C 

ATOM 995 CG LEU A 147 61.174 55.065 22.765 1.00 0.00 C 

ATOM 996 CD1 LEU A 147 62.267 54.955 23.819 1.00 0.00 C 

ATOM 997 CD2 LEU A 147 61.404 54.043 21.659 1.00 0.00 C 

25 ATOM 998 N GLN A 148 56.707 55.258 23.917 1.00 0.00 N 

ATOM 999 CA GLN A 148 55.456 54.838 24.541 1.00 0.00 C 

ATOM 1000 C GLN A 148 54.393 54.655 23.461 1.00 0.00 C 

ATOM 1001 O GLN A 148 53.578 53.734 23.525 1.00 0.00 O 

ATOM 1002 CB GLN A 148 54.961 55.874 25.552 1.00 0.00 C 

30 ATOM 1003 CG GLN A 148 55.689 55.857 26.884 1.00 0.00 C 

ATOM 1004 CD GLN A 148 55.020 56.743 27.922 1.00 0.00 C 

ATOM 1005 OE1 GLN A 148 55.426 56.768 29.086 1.00 0.00 O 

ATOM 1006 NE2 GLN A 148 53.990 57.477 27.505 1.00 0.00 N 

ATOM 1007 N MET A 149 54.407 55.537 22.468 1,00 0.00 N 

35 ATOM 1008 CA MET A 149 53.434 55.467 21.385 1.00 0.00 C 

ATOM 1009 C MET A 149 53.634 54.196 20.565 1.00 0.00 C 

ATOM 1010 O MET A 149 52.673 53.508 20.232 1.00 0.00 O 

ATOM 1011 CB MET A 149 53.554 56.697 20.481 1.00 0.00 C 

ATOM 1012 CG MET A 149 52.533 56.747 19.352 1.00 0.00 C 

40 ATOM 1013 SD MET A 149 50.823 56.818 19.930 1.00 0.00 S 

ATOM 1014 CE MET A 149 50.571 58.585 20.073 1.00 0.00 C 

ATOM 1015 N LYS A 150 54.885 53.879 20.244 1.00 0.00 N 

ATOM 1016 CA LYS A 150 55.161 52.682 19.462 1.00 0.00 C 

ATOM 1017 C LYS A 150 54.741 51.429 20.215 1.00 0.00 C 

45 ATOM 1018 O LYS A 150 54.314 50.450 19.606 1.00 0.00 O 

ATOM 1019 CB LYS A 150 56.650 52.601 19.104 1.00 0.00 C 

ATOM 1020 CG LYS A 150 57.115 53.724 18.190 1.00 0.00 C 

ATOM 1021 CD LYS A 150 58.623 53.714 17.990 1.00 0.00 C 

ATOM 1022 CE LYS A 150 59.076 52.493 17.214 1.00 0.00 C 

50 ATOM 1023 NZ LYS A 150 60.543 52.537 16.948 1.00 0.00 N 

ATOM 1024 N SERA 151 54.839 51.464 21.542 1.00 0.00 N 

ATOM 1025 CA SER A 151 54.474 50.305 22.346 1.00 0.00 C 

ATOM 1026 C SER A 151 52.969 50.036 22.373 1.00 0.00 C 

ATOM 1027 O SER A 151 52.548 48.883 22.271 1.00 0.00 O 

55 ATOM 1028 CB SER A 151 55.002 50.454 23.778 1.00 0.00 C 

ATOM 1029 OG SER A 151 54.275 51.437 24.486 1.00 0.00 O 

ATOM 1030 N ILE A 152 52.150 51.079 22.505 1.00 0.00 N 

ATOM 1031 CA ILE A 152 50.710 50.854 22.528 1.00 0.00 C 

ATOM 1032 C ILE A 152 50.178 50.469 21.147 1.00 0.00 C 

60 ATOM 1033 O ILE A 152 49.072 49.948 21.026 1.00 0.00 O 

ATOM 1034 CB ILE A 152 49.924 52.074 23.075 1.00 0.00 C 
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ATOM 2560 CD1 LEU A 338 30.606 54.919 22.707 1.00 0.00 C 

ATOM 2561 CD2 LEU A 338 32.827 54.315 23.693 1.00 0.00 C 

ATOM 2562 N GLY A 339 32.211 56.849 18.983 1.00 0.00 N 

ATOM 2563 CA GLY A 339 32.924 58.091 18.731 1.00 0.00 C 

5 ATOM 2564 C GLY A 339 32.131 59.114 17.943 1.00 0.00 C 

ATOM 2565 O GLY A 339 30.970 58.889 17.596 1.00 0.00 O 

ATOM 2566 N ASP A 340 32.769 60.244 17.658 1.00 0.00 N 

ATOM 2567 CA ASP A 340 32.141 61.329 16.919 1.00 0.00 C 

ATOM 2568 C ASP A 340 33.265 62.270 16.472 1.00 0.00 C 

10 ATOM 2569 O ASP A 340 34.445 61.949 16.626 1.00 0.00 O 

ATOM 2570 CB ASP A 340 31.165 62.076 17.831 1.00 0.00 C 

ATOM 2571 CG ASP A 340 30.043 62.759 17.068 1.00 0.00 C 

ATOM 2572 OD1 ASP A 340 30.242 63.118 15.886 1.00 0.00 O 

ATOM 2573 OD2 ASP A 340 28.961 62.952 17 . 664 1 . 00 0 . 00 O 

15 ATOM 2574 N ASP A 341 32.902 63.432 15.938 1.00 0.00 N 

ATOM 2575 CA ASP A 341 33.896 64.396 15.468 1.00 0.00 C 

ATOM 2576 C ASP A 341 34.810 64.907 16.574 1.00 0.00 C 

ATOM 2577 O ASP A 341 34.343 65.379 17.612 1.00 0.00 O 

ATOM 2578 CB ASP A 341 33.211 65.593 14.803 1.00 0.00 C 

20 ATOM 2579 CG ASP A 341 32.537 65.230 13.494 1.00 0.00 C 

ATOM 2580 ODl ASP A 341 32.491 64.027 13.165 1.00 0.00 O 

ATOM 2581 OD2 ASP A 341 32.053 66.150 12.799 1.00 0.00 O 

ATOM 2582 N PHE A 342 36.113 64.813 16.330 1.00 0.00 N 

ATOM 2583 CA PHE A 342 37.128 65.274 17.264 1.00 0.00 C 

25 ATOM 2584 C PHE A 342 36.835 64.915 18.717 1.00 0.00 C 

ATOM 2585 O PHE A 342 36.977 65.748 19.619 1.00 0.00 O 

ATOM 2586 CB PHE A 342 37.314 66.788 17.125 1.00 0.00 C 

ATOM 2587 CG PHE A 342 37.850 67.213 15.781 1.00 0.00 C 

ATOM 2588 CD1 PHE A 342 36.986 67.554 14.743 1.00 0.00 C 

30 ATOM 2589 CD2 PHE A 342 39.223 67.254 15.549 1.00 0.00 C 

ATOM 2590 CEl PHE A 342 37.478 67.932 13.492 1.00 0.00 C 

ATOM 2591 CE2 PHE A 342 39.728 67.628 14.303 1.00 0.00 C 

ATOM 2592 CZ PHE A 342 38.853 67.969 13.272 1.00 0.00 C 

ATOM 2593 N ARG A 343 36.432 63.667 18.936 1.00 0.00 N 

35 ATOM 2594 CA ARG A 343 36.138 63.187 20.278 1.00 0.00 C 

ATOM 2595 C ARG A 343 37.402 62.725 20.993 1.00 0.00 C 

ATOM 2596 O ARG A 343 38.473 62.616 20.389 1.00 0.00 O 

ATOM 2597 CB ARG A 343 35.131 62.033 20.226 1.00 0.00 C 

ATOM 2598 CG ARG A 343 33.686 62.471 20.031 1.00 0.00 C 

40 ATOM 2599 CD ARG A 34 3 33.263 63.432 21.143 1.00 0.00 C 

ATOM 2600 NE ARG A 34 3 31.836 63.740 21.103 1.00 0.00 N 

ATOM 2601 CZ ARG A 343 30.892 63.005 21.682 1.00 0.00 C 

ATOM 2602 NH1 ARG A 343 31.213 61.907 22.356 1.00 0.00 N 

ATOM 2603 NH2 ARG A 343 29.621 63.374 21.591 1.00 0.00 N 

45 ATOM 2604 N PHE A 344 37.256 62.451 22.285 1.00 0.00 N 

ATOM 2605 CA PHE A 344 38.351 61.999 23.136 1.00 0.00 C 

ATOM 2606 C PHE A 344 39.486 63.003 23.223 1.00 0.00 C 

ATOM 2607 O PHE A 344 40.661 62.653 23.089 1.00 0.00 O 

ATOM 2608 CB PHE A 344 38.864 60.640 22.660 1.00 0.00 C 

50 ATOM 2609 CG PHE A 344 37.858 59.544 22.816 1.00 0.00 C 

ATOM 2610 CD1 PHE A 344 37.086 59.128 21.737 1.00 0.00 C 

ATOM 2611 CD2 PHE A 344 37.633 58.968 24.064 1.00 0.00 C 

ATOM 2612 CEl PHE A 344 36.103 58.158 21.896 1.00 0.00 C 

ATOM 2613 CE2 PHE A 344 36.650 57.996 24.234 1.00 0.00 C 

55 ATOM 2614 CZ PHE A 344 35.883 57.591 23.147 1.00 0.00 C 

ATOM 2615 N LYS A 345 39.111 64.253 23.475 1.00 0.00 N 

ATOM 2616 CA LYS A 345 40.055 65.352 23.591 1.00 0.00 C 

ATOM 2617 C LYS A 345 40.501 65.568 25.038 1.00 0.00 C 

ATOM 2618 O LYS A 345 41.691 65.581 25.328 1.00 0.00 O 

60 ATOM 2619 CB LYS A 345 39.411 66,631 23.056 1.00 0.00 C 

ATOM 2620 CG LYS A 345 40.278 67.865 23.145 1.00 0.00 C 
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451 


-8 


645 


1 


00 


0 


00 


0 


ATOM 


8262 


OH2 


WAT 


w 


80 


41 


025 


52 


500 


-17, 


001 


1 


00 


0 


00 


0 


ATOM 


8263 


OH2 


WAT 


w 


81 


31 


432 


76 


939 


-6 


261 


1 


00 


0 


00 


0 


ATOM 


8264 


OH2 


WAT 


w 


82 


43 


372 


68 


424 


-24 


901 


1 


00 


0 


00 


0 


ATOM 


8265 


OH2 


WAT 


w 


83 


32 


759 


81 


310 


-11 


722 


1 


00 


0 


00 


0 


ATOM 


8266 


OH2 


WAT 


w 


84 


20 


528 


66 


418 


-19 


640 


1 


00 


0 


00 


0 


ATOM 


8267 


OH2 


WAT 


w 


85 


40 


399 


62 


932 


6 


179 


1 


00 


0 


00 


0 


ATOM 


8268 


OH2 


WAT 


w 


86 


42 


121 


47 


451 


-24 


562 


1 


00 


0 


00 


0 


ATOM 


8269 


OH2 


WAT 


w 


87 


44 


531 


42 


000 


-0 


591 


1 


00 


0. 


00 


0 


ATOM 


8270 


OH2 


WAT 


w 


88 


38 


621 


55 


297 


-31, 


696 


1. 


00 


0 


00 


0 


ATOM 


8271 


OH2 


WAT 


w 


89 


22 


298 


54 


497 


-11 


948 


1. 


00 


0. 


00 


0 


ATOM 


8272 


OH2 


WAT 


w 


90 


48 


309 


89 


4 67 


-30 


626 


1. 


00 


0. 


00 


0 


ATOM 


8273 


OH2 


WAT 


w 


91 


36 


699 


51 


822 


3 


762 


1 


00 


0. 


00 


0 


ATOM 


8274 


OH2 


WAT 


w 


92 


38 


737 


62, 


445 


8 


410 


1 


00 


0. 


00 


0 


ATOM 


8275 


OH 2 


WAT 


w 


93 


47, 


441 


46 


140 


3. 


627 


1. 


00 


0 


00 


0 


ATOM 


8276 


OH 2 


WAT 


w 


94 


33 


595 


57, 


288 


-35. 


878 


1. 


00 


0 


00 


0 


ATOM 


8277 


OH2 


WAT 


w 


95 


32 


313 


77 


764 


-29. 


564 


1. 


00 


0. 


00 


0 


ATOM 


8278 


OH2 


WAT 


w 


96 


39 


241 


39 


239 


17, 


199 


1. 


00 


0 


00 


0 


ATOM 


8279 


OH2 


WAT 


w 


97 


35 


924 


53 


322 


1. 


328 


1 


00 


0. 


00 


0 


ATOM 


8280 


OH2 


WAT 


w 


98 


14. 


453 


60. 


152 


5. 


057 


1. 


00 


0 


00 


0 


ATOM 


8281 


OH2 


WAT 


w 


99 


21 


658 


57, 


289 


5. 


019 


1. 


00 


0 


00 


0 


ATOM 


8282 


OH2 


WAT 


w 


100 


43 


041 


48 


243 


11. 


159 


1. 


00 


0. 


00 


0 


ATOM 


8283 


OH2 


WAT 


w 


101 


26, 


081 


73. 


002 


5, 


562 


1. 


00 


0. 


00 


0 


ATOM 


8284 


OH2 


WAT 


w 


102 


27 


438 


48. 


664 


-21, 


564 


1. 


00 


0. 


00 


0 


ATOM 


8285 


OH2 


WAT 


w 


103 


33. 


310 


78. 


867 


-4. 


982 


1. 


00 


0. 


00 


0 


ATOM 


8286 


OH2 


WAT 


w 


104 


49. 


995 


59, 


656 


-11, 


589 


1, 


00 


0. 


00 


0 


ATOM 


8287 


OH2 


WAT 


w 


105 


25. 


585 


51. 


377 


-9. 


520 


1. 


00 


0. 


00 


0 


ATOM 


8288 


OH2 


WAT 


w 


106 


40. 


204 


56, 


517 


5, 


844 


1. 


00 


0. 


00 


0 


ATOM 


8289 


OH2 


WAT 


w 


107 


14, 


014 


59. 


888 


-3. 


434 


1. 


00 


0. 


00 


0 


ATOM 


8290 


OH2 


WAT 


w 


108 


19. 


703 


53, 


642 


-11. 


754 


1. 


00 


0. 


00 


0 


ATOM 


8291 


OH2 


WAT 


w 


109 


26. 


515 


63. 


824 


17. 


139 


1. 


00 


0. 


00 


0 


ATOM 


8292 


OH2 


WAT 


w 


110 


19. 


778 


57. 


566 


-11. 


797 


1 . 


00 


0. 


00 


0 


ATOM 


8293 


OH2 


WAT 


w 


111 


47 . 


187 


48. 


531 


11. 


419 


1. 


00 


0. 


00 


0 
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ATOM 


8294 


OH2 


WAT 


W 


112 


67. 


806 


81, 


.793 


-25. 


640 


1. 


00 


0. 


,00 


ATOM 


8295 


OH2 


WAT 


w 


113 


22, 


910 


51. 


.819 


-8. 


.744 


1. 


00 


0. 


.00 


ATOM 


8296 


OH2 


WAT 


w 


114 


46. 


600 


70, 


.560 


-36. 


706 


1. 


00 


0. 


.00 


ATOM 


8297 


OH2 


WAT 


w 


115 


20. 


546 


57, 


.585 


-5. 


285 


1. 


00 


0. 


.00 


ATOM 


8298 


OH2 


WAT 


w 


116 


18. 


164 


60, 


.620 


24. 


,435 


1. 


00 


0. 


.00 


ATOM 


8299 


OH2 


WAT 


w 


117 


41. 


283 


68. 


.622 


-32. 


525 


1. 


00 


0. 


.00 


ATOM 


8300 


OH2 


WAT 


w 


118 


38. 


310 


40, 


.108 


1. 


670 


1. 


00 


0. 


.00 


ATOM 


8301 


OH2 


WAT 


w 


119 


23. 


864 


58, 


.245 


6. 


,322 


1. 


00 


0. 


.00 


ATOM 


8302 


OH2 


WAT 


w 


120 


18. 


,116 


59. 


.931 


-20. 


,090 


1. 


00 


0. 


.00 


ATOM 


8303 


OH2 


WAT 


w 


121 


41. 


272 


77, 


.741 


-14. 


,097 


1. 


00 


0. 


.00 


ATOM 


8304 


OH2 


WAT 


w 


122 


52. 


834 


59. 


.194 


-1. 


,694 


1. 


00 


0. 


.00 


ATOM 


8305 


OH2 


WAT 


w 


123 


47. 


929 


49, 


.601 


13. 


774 


1. 


00 


0. 


,00 


ATOM 


8306 


OH2 


WAT 


w 


124 


35. 


223 


43, 


.723 


28. 


642 


1. 


00 


0, 


.00 


ATOM 


8307 


OH2 


WAT 


w 


125 


59. 


493 


60, 


.742 


-6. 


,299 


1. 


00 


0. 


.00 


ATOM 


8308 


OH2 


WAT 


w 


126 


53. 


675 


72, 


.733 


-19. 


,141 


1. 


00 


0. 


,00 


ATOM 


8309 


OH2 


WAT 


w 


127 


39. 


803 


54. 


.101 


-20, 


,844 


1. 


00 


0. 


.00 


ATOM 


8310 


OH2 


WAT 


w 


128 


33. 


123 


36. 


.777 


-4. 


,396 


1. 


00 


0. 


.00 


ATOM 


8311 


OH2 


WAT 


w 


129 


14. 


314 


64. 


.081 


-16. 


354 


1. 


00 


0. 


.00 


ATOM 


8312 


OH2 


WAT 


w 


130 


47. 


524 


43. 


.079 


-2. 


432 


1. 


00 


0. 


.00 


ATOM 


8313 


OH2 


WAT 


w 


131 


46. 


908 


55. 


.986 


-25. 


481 


1. 


00 


0. 


.00 


ATOM 


8314 


OH2 


WAT 


w 


132 


32. 


142 


58. 


.803 


-28. 


456 


1. 


00 


0. 


.00 


ATOM 


8315 


OH2 


WAT 


w 


133 


49. 


282 


50. 


.063 


28. 


217 


1. 


00 


0, 


.00 


ATOM 


8316 


OH2 


WAT 


w 


134 


26. 


302 


37, 


.406 


29. 


348 


1. 


00 


0. 


.00 


ATOM 


8317 


OH2 


WAT 


w 


135 


51. 


894 


46. 


.658 


1. 


206 


1. 


00 


0. 


,00 


ATOM 


8318 


OH2 


WAT 


w 


136 


41. 


699 


58. 


.872 


-16. 


019 


1. 


00 


0. 


.00 


ATOM 


8319 


OH2 


WAT 


w 


137 


13. 


825 


54. 


.259 


4. 


434 


1. 


00 


0. 


.00 


ATOM 


8320 


OH2 


WAT 


w 


138 


67. 


796 


79, 


.172 


-25. 


325 


1. 


00 


0. 


.00 


ATOM 


8321 


OH2 


WAT 


w 


139 


42. 


167 


75. 


.423 


-16. 


044 


1. 


00 


0. 


.00 


ATOM 


8322 


OH2 


WAT 


w 


140 


23. 


124 


56. 


.190 


30. 


804 


1. 


00 


0. 


,00 


ATOM 


8323 


OH2 


WAT 


w 


141 


56. 


986 


61, 


.143 


1. 


570 


1. 


00 


0. 


.00 


ATOM 


8324 


OH2 


WAT 


w 


142 


20. 


070 


74. 


.557 


-6. 


,868 


1. 


00 


0. 


.00 


ATOM 


8325 


OH2 


WAT 


w 


143 


13. 


368 


53. 


.676 


1. 


802 


1. 


00 


0. 


,00 


ATOM 


8326 


OH2 


WAT 


w 


144 


34. 


263 


34, 


.517 


16. 


103 


1. 


00 


0. 


.00 


ATOM 


8327 


OH 2 


WAT 


w 


145 


33. 


945 


64. 


.719 


-32. 


234 


1. 


00 


0. 


.00 


ATOM 


8328 


OH2 


WAT 


w 


146 


14. 


059 


49, 


.309 


15. 


412 


1. 


00 


0. 


,00 


ATOM 


8329 


OH2 


WAT 


w 


147 


30. 


401 


35, 


.321 


-12. 


383 


1. 


00 


0. 


.00 


ATOM 


8330 


OH 2 


WAT 


w 


148 


18. 


402 


55. 


.023 


14. 


030 


1. 


00 


0. 


.00 


ATOM 


8331 


OH2 


WAT 


w 


149 


15. 


633 


60. 


.472 


15. 


485 


1. 


00 


0. 


,00 


ATOM 


8332 


OH2 


WAT 


w 


150 


16. 


788 


74, 


.865 


-3. 


,021 


1. 


00 


0. 


.00 


ATOM 


8333 


OH2 


WAT 


w 


151 


56. 


517 


46, 


.305 


-17 . 


,438 


1. 


00 


0. 


.00 


ATOM 


8334 


OH2 


WAT 


w 


152 


45. 


631 


60. 


.108 


-18. 


238 


1. 


00 


0. 


.00 


ATOM 


8335 


OH2 


WAT 


w 


153 


28. 


185 


84, 


.769 


-35. 


431 


1. 


00 


0, 


.00 


ATOM 


8336 


OH2 


WAT 


w 


154 


73. 


024 


65. 


.247 


-7. 


,728 


1. 


00 


0. 


.00 


ATOM 


8337 


OH2 


WAT 


w 


155 


73. 


780 


66. 


,204 


-17. 


660 


1. 


00 


0. 


,00 


ATOM 


8338 


OH2 


WAT 


w 


156 


59. 


268 


50. 


.053 


-0. 


663 


1. 


00 


0. 


,00 


ATOM 


8339 


OH2 


WAT 


w 


157 


23. 


056 


78. 


.515 


-24. 


846 


1. 


00 


0. 


,00 


ATOM 


8340 


OH2 


WAT 


w 


158 


14. 


298 


54. 


.490 


-16. 


078 


1. 


00 


0. 


.00 


ATOM 


8341 


OH2 


WAT 


w 


159 


43. 


564 


73. 


.335 


5. 


822 


1. 


00 


0. 


.00 


ATOM 


8342 


OH2 


WAT 


w 


160 


11. 


895 


55. 


,196 


-12. 


500 


1. 


00 


0. 


.00 


ATOM 


8343 


OH2 


WAT 


w 


161 


63. 


546 


44 . 


. 759 


-6. 


927 


1. 


00 


0. 


.00 


ATOM 


8344 


OH2 


WAT 


w 


162 


24. 


751 


51. 


.137 


-24. 


137 


1. 


00 


0. 


,00 


ATOM 


8345 


OH2 


WAT 


w 


163 


35. 


483 


45. 


.434 


-27. 


391 


1. 


00 


0. 


,00 


ATOM 


8346 


OH2 


WAT 


w 


164 


28. 


707 


49. 


.870 


-24. 


430 


1. 


00 


0, 


,00 


ATOM 


8347 


OH2 


WAT 


w 


165 


54. 


814 


81. 


.194 


-23. 


475 


1. 


00 


0. 


, 00 


ATOM 


8348 


OH2 


WAT 


w 


166 


23. 


601 


63. 


.813 


7. 


819 


1. 


00 


0. 


,00 


ATOM 


8349 


OH2 


WAT 


w 


167 


49. 


745 


57 . 


.750 


-7. 


642 


1. 


00 


0. 


,00 


ATOM 


8350 


OH2 


WAT 


w 


168 


46. 


072 


71. 


.136 


0. 


115 


1. 


00 


0. 


,00 


ATOM 


8351 


OH2 


WAT 


w 


169 


24. 


320 


55. 


.824 


-1. 


913 


1. 


00 


0. 


,00 


ATOM 


8352 


OH2 


WAT 


w 


170 


28. 


642 


41. 


.362 


-21. 


793 


1. 


00 


0. 


,00 


ATOM 


8353 


OH 2 


WAT 


w 


171 


37. 


052 


45. 


.715 


27. 


4 62 


1. 


00 


0. 


.00 


ATOM 


3354 


OH 2 


WAT 


w 


172 


41. 


481 


41. 


.357 


-17. 


624 


1. 


00 


0. 


,00 
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ATOM 


8355 


OH2 


WAT 


W 


173 


40. 


307 


62 


729 


-31. 


958 


1. 


00 


0. 


00 


ATOM 


8356 


OH2 


WAT 


w 


174 


49. 


315 


59 


394 


-28. 


703 


1. 


00 


0. 


00 


ATOM 


8357 


OH2 


WAT 


w 


175 


64. 


406 


39 


380 


-10. 


443 


1. 


00 


0. 


00 


ATOM 


8358 


OH2 


WAT 


w 


176 


19. 


161 


72 


628 


2. 


904 


1. 


00 


0. 


00 


ATOM 


8359 


OH2 


WAT 


w 


177 


17. 


057 


70 


449 


-16. 


414 


1. 


00 


0. 


00 


ATOM 


8360 


OH2 


WAT 


w 


178 


9. 


539 


52 


641 


16. 


836 


1. 


00 


0. 


00 


ATOM 


8361 


OH2 


WAT 


w 


179 


53. 


885 


71 


468 


-36. 


741 


1. 


00 


0 


00 


ATOM 


8362 


OH2 


WAT 


w 


180 


50. 


902 


71 


060 


-23. 


481 


1. 


00 


0 


00 


ATOM 


8363 


OH2 


WAT 


w 


181 


49. 


374 


44 


299 


4. 


432 


1. 


00 


0 


00 


ATOM 


8364 


OH2 


WAT 


w 


182 


59. 


359 


68 


451 


2. 


377 


1. 


00 


0 


00 


ATOM 


8365 


OH2 


WAT 


w 


183 


25. 


436 


53 


879 


11. 


07 6 


1. 


00 


0 


00 


ATOM 


8366 


OH2 


WAT 


w 


184 


69. 


087 


61 


577 


-2. 


401 


1. 


00 


0 


00 


ATOM 


8367 


OH2 


WAT 


w 


185 


58. 


340 


77 


598 


-38. 


439 


1. 


00 


0 


00 


ATOM 


8368 


OH2 


WAT 


w 


186 


44. 


378 


46 


203 


9. 


940 


1. 


00 


0,00 


ATOM 


8369 


OH2 


WAT 


w 


187 


46. 


546 


57. 


482 


-18. 


766 


1. 


00 


0 


00 


ATOM 


8370 


OH2 


WAT 


w 


188 


33. 


489 


83 


732 


-10. 


371 


1. 


00 


0 


00 


ATOM 


8371 


OH2 


WAT 


w 


189 


47. 


049 


68 


676 


-30. 


437 


1. 


00 


0 


00 


ATOM 


8372 


OH2 


WAT 


w 


190 


44. 


368 


56. 


164 


-15. 


428 


1. 


00 


0 


00 


ATOM 


8373 


OH2 


WAT 


w 


191 


13. 


141 


67. 


038 


-4. 


766 


1. 


00 


0 


00 


ATOM 


8374 


OH2 


WAT 


w 


192 


24. 


521 


39 


279 


-11. 


477 


1. 


00 


0 


00 


ATOM 


8375 


OH2 


WAT 


w 


193 


41. 


586 


61. 


285 


-29. 


726 


1. 


00 


0 


00 


ATOM 


8376 


OH2 


WAT 


w 


194 


27. 


184 


41 


484 


13. 


312 


1. 


00 


0 


00 


ATOM 


8377 


OH2 


WAT 


w 


195 


41. 


986 


88 


026 


-40. 


822 


1. 


00 


0. 


00 


ATOM 


8378 


OH2 


WAT 


w 


196 


69. 


847 


70 


116 


-12. 


640 


1. 


00 


0. 


00 


ATOM 


8379 


OH2 


WAT 


w 


197 


64. 


131 


78 


076 


-35. 


187 


1. 


00 


0. 


00 


ATOM 


8380 


OH2 


WAT 


w 


198 


20. 


419 


65 


482 


7. 


650 


1. 


00 


0 


00 


ATOM 


8381 


OH2 


WAT 


w 


199 


11. 


387 


61 


513 


-18. 


591 


1. 


00 


0 


00 


ATOM 


8382 


OH2 


WAT 


w 


200 


24. 


508 


70 


807 


-8. 


820 


1. 


00 


0 


00 


ATOM 


8383 


OH2 


WAT 


w 


201 


25. 


021 


40 


148 


14. 


199 


1. 


00 


0 


00 


ATOM 


8384 


OH2 


WAT 


w 


202 


22. 


730 


63 


440 


-33. 


177 


1. 


00 


0 


00 


ATOM 


8385 


OH2 


WAT 


w 


203 


41. 


675 


43 


444 


7. 


942 


1. 


00 


0 


00 


ATOM 


8386 


OH2 


WAT 


w 


204 


21. 


031 


51 


927 


47. 


241 


1. 


00 


0 


00 


ATOM 


8387 


OH2 


WAT 


w 


205 


41. 


315 


50 


431 


33. 


202 


1. 


00 


0 


00 


ATOM 


8388 


OH2 


WAT 


w 


206 


45. 


964 


52 


942 


-19. 


262 


1. 


00 


0 


00 


ATOM 


8389 


OH2 


WAT 


w 


207 


51. 


437 


76 


535 


-40. 


973 


1. 


00 


0 


00 


ATOM 


8390 


OH2 


WAT 


w 


208 


25. 


691 


86 


538 


-31. 


936 


1. 


00 


0 
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c 


ATOM 


9195 


C2 


MPD 


M 


1 


16 


.246 


61 


.411 


10 


.589 


1 


.00 


0 


.00 


c 


ATOM 


9196 


02 


MPD 


M 


1 


16 


.899 


60 


.292 


9 


. 952 


1 


.00 


0 


. 00 


o 


ATOM 


9197 


CM 


MPD 


M 


1 


16 


.897 


62 


. 682 


10 


. 105 


1 


.00 


0 


. 00 


c 


ATOM 


9198 


C3 


MPD 


M 


1 


16 


.386 


61 


.237 


12 


.121 


1 


.00 


0 


.00 


c 


ATOM 


9199 


C4 


MPD 


M 


1 


17 


.772 


60 


. 909 


12 


. 678 


1 


.00 


0 


.00 


c 


ATOM 


9200 


04 


MPD 


M 


1 


17 


.676 


59 


.909 


13 


.666 


1 


.00 


0 


.00 


0 


ATOM 


9201 


C5 


MPD 


M 


1 


18 


.376 


62 


. 135 


13 


.353 


1 


. 00 


0 


. 00 


c 


ATOM 


9202 


ZN 


ZN1 


Z 


1 


34 


.563 


64 


. 336 


8 


.063 


1 


.00 


0 


. 00 


Zn 
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Table 9 Data Collection Statistics 



MAD (Se-Met) ofdGMII 



Native dGMII 



Wavelength (A) 
Effective 
resolution (A) 
Maximum 
resolution (A) 
Highest resolution 
shell 

Temperature (K) 
# unique reflections 
overall 
shell 

completeness (%) 

overall 

shell 

Emerge 

overall 
shell 



inflection peak Remote 



0.9790 0.9786 0.9770 



2.14 



1.70 



2.14 



1.70 



2.14 



1.70 



High 
resolution 
1.0 

1.76 

1.4 



2.31-2.14 2.31-2.14 2.31-2.14 1.90-1.76 



100 

59212 
11288 

99.7 
96.2 

0.050 
0.086 



100 

59092 
11297 

99.8 
96.3 

0.054 
0.093 



100 

59218 
11296 

99.8 
96.2 

0.057 
0.105 



100 

104565 
19882 

97.0 
94.9 

0.056 
0.127 



DMNJ 
complex 
1.0 

1.69 

1.5 

1.75-1.69 
100 

114653 
10722 

97.8 
92.6 

0.086 
0.186 



swainsonine 
complex 
1.54189 

1.87 

1.87 

1.91-1.87 

100 

87386 
5601 

99.7 
96.9 

0.078 
0.452 



*R me :o e =Z i) 1 L\\I\-<I>\/I,\Ih where <> is the average of equivalent reflections and the sum is 
extended over all observations, /, for all unique reflections, h. 



Table 10. Refinement Statistics 
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dGMII 


dGMII-swainsonine 
complex 


dGMII-DMNJ 
complex 


Resolution (A) 


500-1.40 


500-1.87 


500-1.5 


Rcryst (%) 


19.30 


18.10 


19.69 


Rfree <%) 


21.05 


20.90 


21.56 


Atoms (#) 


9194 


9202 


9199 


Residues (#) 


1014 


1014 


1014 


Water molecules (#) 


981 


985 


983 


r.m.s.d .Bonds (A) 


0.005 


0.005 


0.006 


r.m.s.d. Angles (°) 


1.32 


1.31 


1.33 


r.m.s.d. Improper dihedrals (°) 


0.81 


0.78 


0.80 


Average B-factors (A 2 ) 


15.8 


19.4 


15.8 


Crossvalidated 








<7a coordinate error (A) 


0.10 


0.14 


0.11 



RcTysr=mFo\-\F c \\/I,\F 0 l where F Q and F c are the observed and calculated structure factors, 
respectively. For Rf ree? the sum is extended over a subset of reflections (-10%) excluded from 
all stages of refinement. 
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